.. meta:: :google-site-verification: pGUJ3Uw-ACan4k_IFjAEHfonB-eSEdjpkTz0yX2cc34 Welcome to CleanMARL documentation! =================================== CleanMARL provides single file implementations of Deep Multi-Agent Reinforcement Learning (MARL) algorithms. In addition, it serves as an **educational resource** for learning the algorithms and implementations of MARL. In a nutshell this project is a combination of `CleanRL `_ and `Spinning Up in Deep RL `_ but for **multi-agent RL**. Github repository: https://github.com/AmineAndam04/cleanmarl Main Features ------------- - Implementations of key MARL algorithms: VDN, QMIX, COMA, MADDPG, FACMAC, IPPO, and MAPPO. - A documentation for algorithms, code and training details. - We support parallel environments and recurrent policies. - Tensorboard and Weights & Biases logging. Contents -------- .. toctree:: :maxdepth: 2 marl cleanmarl VDN QMIX COMA MADDPG FACMAC Idependent PPO Multi-agent PPO design author