This is implementation of PDiT based agents as mentioned in the paper. Here for enhanced decision-making, we utilize Perception Transformers alongside Decision-Making Transformers. By interleaving Perception and Decision Transformers, PDiT optimizes decision-making in complex environments.
PDiT focuses on environments where actions are taken based on image inputs and text prompts. Specifically, we explore scenarios where image-based environments require actions determined by text prompts, which traditionally lack direct correlation between text commands and image data. PDiT aims to improve this interaction by enabling more efficient perception and decision processes.
We have done both offline and online reinforcement learning. For offline learning, we have used the dataset given below for RvS(Reinforcement via supervized learning). And for Unsupervised learning we have used PPO.
We are using the MiniGrid BabyAI dataset for training and evaluation. This dataset provides a simplified, grid-based environment that supports diverse agent tasks, which is ideal for testing perception and decision-making abilities.
Dataset URL: MiniGrid BabyAI
For PDiT using Supervised Learning:
- Set Up the Environment:
- Create a virtual environment for PDiT:
python3 -m venv pd_environment
- Activate the environment:
- For Linux/Mac:
source pd_environment/bin/activate
- For Windows:
pd_environment\Scripts\activate
- For Linux/Mac:
- Install Required Packages:
- Use the
requirements.txt
file to install dependencies:pip install -r requirements.txt
- Use the
3.Download Weights from the source and change paths in the visualiser script. 4. Run the visualiser file
For the PPO / PPO Based PDiT:
- Have the necessary Requirements installed.
Install Required Packages:
- Use the
requirements.txt
file to install dependencies:pip install -r requirements.txt
- Use the
- Download the weights from Source and change paths in the visualiser script.
- Run the visualiser.
The demo links can be found here.
The weights for PPO based training method are here. The weights for RvS based training method are here.