DeepRL Sub-optimality

Code for the paper on analyzing the sub-optimality of deep RL algorithms. Based on the cleanrl codebase.

classic control

python cleanrl/dqn.py --env-id CartPole-v1 python cleanrl/ppo.py --env-id CartPole-v1 python cleanrl/c51.py --env-id CartPole-v1

atari

poetry install -E atari python cleanrl/dqn_atari.py --env-id BreakoutNoFrameskip-v4 python cleanrl/c51_atari.py --env-id BreakoutNoFrameskip-v4 python cleanrl/ppo_atari.py --env-id BreakoutNoFrameskip-v4 python cleanrl/sac_atari.py --env-id BreakoutNoFrameskip-v4

Citing Paper

If you use CleanRL in your work, please cite our technical paper:

@misc{berseth2025explorationoptimizationproblemdeep,
      title={Is Exploration or Optimization the Problem for Deep Reinforcement Learning?}, 
      author={Glen Berseth},
      year={2025},
      eprint={2508.01329},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2508.01329}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 934 Commits
.github		.github
.vscode		.vscode
benchmark		benchmark
cleanrl		cleanrl
cleanrl_utils		cleanrl_utils
cloud		cloud
data		data
docs		docs
plotting		plotting
requirements		requirements
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitpod.Dockerfile		.gitpod.Dockerfile
.gitpod.yml		.gitpod.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
backupData.sh		backupData.sh
entrypoint.sh		entrypoint.sh
launch.sh		launch.sh
launchGPU.sh		launchGPU.sh
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run_jobs.sh		run_jobs.sh
run_jobs_list.sh		run_jobs_list.sh
test_env.py		test_env.py
tuner_example.py		tuner_example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepRL Sub-optimality

classic control

atari

Citing Paper

About

Uh oh!

Releases

Packages

Languages

License

montrealrobotics/deepRL-sub-opt

Folders and files

Latest commit

History

Repository files navigation

DeepRL Sub-optimality

classic control

atari

Citing Paper

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages