MAPPO Attention Project Repository

Welcome to the repository for our MAPPO (Multi-Agent Proximal Policy Optimization) project. This repository includes the necessary code and scripts to run and manage experiments with MAPPO and modular attention architectures. In addition, we now train our multi-agent systems on three Melting Pot scenario environments:

territory__rooms (number of agents: 9)
allelopathic_harvest__open (number of agents: 16)
prisoners_dilemma_in_the_matrix__arena (number of agents: 8)

Pretraining and Fine-Tuning

Pretraining the Slot Attention Module
We pretrain the Slot Attention module for instance on the territory__rooms environment.
Fine-Tuning for Multi-Agent RL
We fine-tune the Slot Attention representations for the multi-agent RL task by copying the pretrained model for all agents and then fine-tuning the high-level features using LoRA.

Environment Setup

Python Version and Virtual Environment

Python Version: Use Python 3.10 to ensure compatibility with CUDA 12 for PyTorch.

Creating a Conda Environment:

conda create -n meltingpot python=3.10
conda activate meltingpot

Creating a Virtual Environment:

python3.10 -m venv meltingpot
source meltingpot/bin/activate

Installing Dependencies

PyTorch Installation (CUDA 12 Compatible):
Install a CUDA 12 compatible version of PyTorch. For example:

conda install -c nvidia cuda-toolkit=12.1
conda install conda-forge::jsonnet
pip install gym[atari,accept-rom-license]
conda install conda-forge::atari_py 
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

or

conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

Install mpi4py from GitHub:

python -m pip install git+https://github.com/mpi4py/mpi4py

Installing Flash Attention
This package requires special installation flags and does not exist in the requirements.txt, so it should be installed separately:
```
pip install flash-attn --no-build-isolation
```
Install Numpy with MKL Optimizations:
For enhanced performance in numerical computations:
```
conda install -c conda-forge numpy mkl_fft mkl_random
```
The MKL libraries provide optimized implementations of various math routines, significantly improving performance for linear algebra operations used in deep learning.
Install Required Packages:
Replace the path with your specific requirements file if needed:
```
pip install --no-cache-dir -r requirements.txt
```
Install this Repository in Editable Mode:
```
pip install -e .
```

Running the Scripts

Pretraining and Fine-Tuning

Pretraining (e.g., for territory__rooms):

./run_mappo_territory_rooms_pretrain_slot_att_QSA.sh

Fine-Tuning the Slot Attention Representations:

./run_mappo_territory__room_training_slot_attention_and_rim.sh

Name		Name	Last commit message	Last commit date
Latest commit History 208 Commits
meltingpot		meltingpot
onpolicy		onpolicy
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
install_sc2.sh		install_sc2.sh
requirements.txt		requirements.txt
run_mappo__allelopathic_harvest_train_rim_lstm.sh		run_mappo__allelopathic_harvest_train_rim_lstm.sh
run_mappo_cleanup_rim_lstm_sweep.sh		run_mappo_cleanup_rim_lstm_sweep.sh
run_mappo_cleanup_training_slot_attention_and_rim_seed_1.sh		run_mappo_cleanup_training_slot_attention_and_rim_seed_1.sh
run_mappo_pretrain_slot_att_QSA_allelopathic_harvest.sh		run_mappo_pretrain_slot_att_QSA_allelopathic_harvest.sh
run_mappo_prisoners_dilemma_training_slot_attention_and_rim_seed_1.sh		run_mappo_prisoners_dilemma_training_slot_attention_and_rim_seed_1.sh
run_mappo_slot_att_pretrain_rim_lstm_ComputeCanada.sh		run_mappo_slot_att_pretrain_rim_lstm_ComputeCanada.sh
run_mappo_territory__room_train_rim.sh		run_mappo_territory__room_train_rim.sh
run_mappo_territory__room_train_rim_lstm.sh		run_mappo_territory__room_train_rim_lstm.sh
run_mappo_territory__room_train_scoff_lstm.sh		run_mappo_territory__room_train_scoff_lstm.sh
run_mappo_territory__room_training_slot_attention_and_rim.sh		run_mappo_territory__room_training_slot_attention_and_rim.sh
run_mappo_territory_rooms_pretrain_slot_att_QSA.sh		run_mappo_territory_rooms_pretrain_slot_att_QSA.sh
run_mappo_train_slot_att_QSA_RIM_LSTM_allelopathic_harvest.sh		run_mappo_train_slot_att_QSA_RIM_LSTM_allelopathic_harvest.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MAPPO Attention Project Repository

Pretraining and Fine-Tuning

Environment Setup

Python Version and Virtual Environment

Installing Dependencies

Running the Scripts

Pretraining and Fine-Tuning

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

neuronphysics/MAPPO-Attention

Folders and files

Latest commit

History

Repository files navigation

MAPPO Attention Project Repository

Pretraining and Fine-Tuning

Environment Setup

Python Version and Virtual Environment

Installing Dependencies

Running the Scripts

Pretraining and Fine-Tuning

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages