JaxMARL: multi-agent RL environments and algorithms in JAX

Rutherford, A; Ellis, B; Gallici, M; Cook, J; Lupu, A; Ingvarsson, G; Willi, T; Hammond, R; Khan, A; de Witt, CS; Souly, A; Bandyopadhyay, S; Samvelyan, M; Jiang, M; Lange, R; Whiteson, S; Lacerda, B; Hawes, N; Rocktäschel, T; Lu, C; Foerster, J

AI Collection

Conference item

JaxMARL: multi-agent RL environments and algorithms in JAX

Abstract:: Benchmarks play an important role in the development of machine learning algorithms, with reinforcement learning (RL) research having been heavily influenced by the available environments. However, RL environments are traditionally run on the CPU, limiting their scalability with typical academic compute. Recent advancements in JAX have enabled the wider use of hardware acceleration to overcome these computational hurdles, enabling massively parallel RL training pipelines and environments. This is particularly useful for multi-agent reinforcement learning (MARL) research. First of all, multiple agents must be considered at each environment step, adding computational burden, and secondly, the sample complexity is increased due to non-stationarity, decentralised partial observability, or other MARL challenges. In this paper, we present JaxMARL, the first open-source code base that combines ease-of-use with GPU enabled efficiency, and supports a large number of commonly used MARL environments as well as popular baseline algorithms. When considering wall clock time, our experiments show that per-run our JAX-based training pipeline is up to 12500x faster than existing approaches. We also introduce and benchmark SMAX, a vectorised, simplified version of the popular StarCraft Multi-Agent Challenge, which removes the need to run the StarCraft II game engine. This not only enables GPU acceleration, but also provides a more flexible MARL environment, unlocking the potential for self-play, meta-learning, and other future applications in MARL. We provide code at https://github.com/flairox/jaxmarl.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Rutherford, A., Ellis, B., Gallici, M., Cook, J., Lupu, A., Ingvarsson, G., Willi, T., Hammond, R., Khan, A., de Witt, C. S., Souly, A., Bandyopadhyay, S., Samvelyan, M., Jiang, M., Lange, R., Whiteson, S., Lacerda, B., Hawes, N., Rocktäschel, T., … Foerster, J. (2024). JaxMARL: multi-agent RL environments and algorithms in JAX. 23rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024), 2444–2446.

MLA Style

Rutherford, A, et al. “JaxMARL: Multi-Agent RL Environments and Algorithms in JAX.” 23rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024), 2024, pp. 2444–46.

Chicago Style

Rutherford, A, B Ellis, M Gallici, J Cook, A Lupu, G Ingvarsson, T Willi, et al. 2024. “JaxMARL: Multi-Agent RL Environments and Algorithms in JAX.” In 23rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024), 2444–46. Association for Computing Machinery.
Print

Access Document

Files:: Rutherford_et_al_2024_JaxMARL_multi-agent_RL.pdf

(Preview, Version of record, pdf, 4.7MB, Terms of use)

Publication website:: https://dl.acm.org/doi/abs/10.5555/3635637.3663188

Authors

+ Rutherford, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Oxford college:: Pembroke College
Role:: Author
ORCID:: 0000-0002-2662-5602

+ Ellis, B More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Gallici, M More by this author

Role:: Author

+ Cook, J More by this author

Institution:: University of Oxford
Division:: MSD
Department:: NDORMS
Role:: Author
ORCID:: 0000-0002-4156-6989

+ Lupu, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

More authors...

Publisher:: Association for Computing Machinery
Host title:: AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems
Pages:: 2444-2446
Publication date:: 2024-05-06
Acceptance date:: 2023-12-20
Event title:: 23rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)
Event location:: Auckland, New Zealand
Event website:: https://www.aamas2024-conference.auckland.ac.nz/
Event start date:: 2024-05-06
Event end date:: 2024-05-10
EISSN:: 1558-2914
ISSN:: 1548-8403
ISBN:: 9798400704864

Language:: English
Pubs id:: 1997958
Local pid:: pubs:1997958
Deposit date:: 2025-04-02
ARK identifier:: ark:/29072/ora_c55e6f681e8743ed94897eb1ac30c728

Terms of use

Copyright holder:: International Foundation for Autonomous Agents and Multiagent Systems
Rights statement:: © 2024 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). This work is licenced under the Creative Commons Attribution 4.0 International (CC-BY 4.0) licence.
Notes:: This paper was presented at the 23rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024), 6th-10th May 2024, Auckland, New Zealand.

This work is related to the thesis Intelligent interaction at scale.

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Conference item

JaxMARL: multi-agent RL environments and algorithms in JAX

Actions

Access Document

Authors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Conference item

JaxMARL: multi-agent RL environments and algorithms in JAX

Actions

Access Document

Authors

Bibliographic Details

Item Description

Related Items

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions