Repository navigation

#

marl

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python
831
7 天前

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

Python
675
1 年前

Multi-Agent Reinforcement Learning with JAX

Python
621
1 个月前

VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.

Python
452
2 个月前

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL allows to quickly compare different MARL algorithms, tasks, and models while being systematically grounded in its two core tenets: reproducibility and standardization.

Python
448
1 个月前

Multi-Agent Reinforcement Learning (MARL) papers with code

385
3 年前

A Collection of Multi-Agent Reinforcement Learning (MARL) Resources

246
3 年前

[ICLR-2025] POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored to a variety of PO-MAPF settings.

Python
237
1 个月前

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

Jupyter Notebook
149
3 天前

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research. (Accepted at ICML 2025).

C++
124
7 天前

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

Python
119
9 个月前

Projects from basic algorithms to MARL. Implements MADDPG/MATD3 in Predator-Prey pursuit games with PettingZoo MPE environments.

Jupyter Notebook
106
19 天前

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Python
85
2 年前

An Autonomous Spectrum Management Scheme for Unmanned Aerial Vehicle Networks in Disaster Relief Operations using Multi Independent Agent Reinforcement Learning

Python
70
5 年前

A solution for Dynamic Spectrum Management in Mission-Critical UAV Networks using Team Q learning as a Multi-Agent Reinforcement Learning Approach

Python
69
5 年前

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Python
65
6 年前

SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning

Python
59
3 个月前