Model parameters trained with i-DQN
and i-IQN
This repository contains the model parameters trained with i-DQN
on 56 Atari games and trained with i-IQN
on 20 Atari games ๐ฎ. 5 seeds are available for each configuration which makes a total of 380 available models ๐.
The evaluate.ipynb notebook contains a minimal example to evaluate to model parameters ๐งโ๐ซ. It uses JAX ๐. The hyperparameters used during training are reported in config.json ๐ง.
ps: The set of 20 Atari games is included in the set of 56 Atari games.
Model performances
i-DQN and i-IQN are improvements made over DQN and IQN โจ. Check the paper on arXiv! List of games trained with Alien, Amidar, Assault, Asterix, Asteroids, Atlantis, BankHeist, BattleZone, BeamRider, Berzerk, Bowling, Boxing, Breakout, Centipede, ChopperCommand, CrazyClimber, DemonAttack, DoubleDunk, Enduro, FishingDerby, Freeway, Frostbite, Gopher, Gravitar, Hero, IceHockey, Jamesbond, Kangaroo, Krull, KungFuMaster, MontezumaRevenge, MsPacman, NameThisGame, Phoenix, Pitfall, Pong, Pooyan, PrivateEye, Qbert, Riverraid, RoadRunner, Robotank, Seaquest, Skiing, Solaris, SpaceInvaders, StarGunner, Tennis, TimePilot, Tutankham, UpNDown, Venture, VideoPinball, WizardOfWor, YarsRevenge, Zaxxon. |
---|
User installation
Python 3.10 is recommended. Create a Python virtual environment, activate it, update pip and install the package and its dependencies in editable mode:
python3.10 -m venv env
source env/bin/activate
pip install --upgrade pip
pip install numpy==1.23.5 # to avoid numpy==2.XX
pip install -r requirements.txt
pip install --upgrade "jax[cuda12_pip]==0.4.13" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
Citing i-QN
@article{vincent2024iterated,
title={Iterated $ Q $-Network: Beyond the One-Step Bellman Operator},
author={Vincent, Th{\'e}o and Palenicek, Daniel and Belousov, Boris and Peters, Jan and D'Eramo, Carlo},
journal={arXiv preprint arXiv:2403.02107},
year={2024}
}
- Downloads last month
- 6