PDiT: Perception and Decision Transformer (Interleaved)

This is implementation of PDiT based agents as mentioned in the paper. Here for enhanced decision-making, we utilize Perception Transformers alongside Decision-Making Transformers. By interleaving Perception and Decision Transformers, PDiT optimizes decision-making in complex environments.

Overview

PDiT focuses on environments where actions are taken based on image inputs and text prompts. Specifically, we explore scenarios where image-based environments require actions determined by text prompts, which traditionally lack direct correlation between text commands and image data. PDiT aims to improve this interaction by enabling more efficient perception and decision processes.

Training

We have done both offline and online reinforcement learning. For offline learning, we have used the dataset given below for RvS(Reinforcement via supervized learning). And for Unsupervised learning we have used PPO.

Dataset

We are using the MiniGrid BabyAI dataset for training and evaluation. This dataset provides a simplified, grid-based environment that supports diverse agent tasks, which is ideal for testing perception and decision-making abilities.

Dataset URL: MiniGrid BabyAI

Visualising The model

For PDiT using Supervised Learning:

Set Up the Environment:

Create a virtual environment for PDiT:
```
python3 -m venv pd_environment
```

Activate the environment:

For Linux/Mac:
```
source pd_environment/bin/activate
```
For Windows:
```
pd_environment\Scripts\activate
```

Install Required Packages:
- Use the requirements.txt file to install dependencies:
```
pip install -r requirements.txt
```

3.Download Weights from the source and change paths in the visualiser script. 4. Run the visualiser file

For the PPO / PPO Based PDiT:

Have the necessary Requirements installed. Install Required Packages:
- Use the requirements.txt file to install dependencies:
```
pip install -r requirements.txt
```
Download the weights from Source and change paths in the visualiser script.
Run the visualiser.

Video Demos

The demo links can be found here.

Weights

The weights for PPO based training method are here. The weights for RvS based training method are here.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
PDIT/RvS		PDIT/RvS
PPO		PPO
PPObasedPDiT		PPObasedPDiT
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDiT: Perception and Decision Transformer (Interleaved)

Overview

Training

Dataset

Visualising The model

Video Demos

Weights

About

Releases

Packages

Contributors 2

Languages

neverthesameagain/PDiT

Folders and files

Latest commit

History

Repository files navigation

PDiT: Perception and Decision Transformer (Interleaved)

Overview

Training

Dataset

Visualising The model

Video Demos

Weights

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages