DQN-for-Atari-games

Implementation of a DQN based intelligent agent for taking optimal discrete control actions and train it to play several Atari games and watch the agent's performance.

1. Requirements

Python 3.5+
Pytorch
Tensorboard X
OpenAI Gym
OpenCV (cv2)
Numpy
Matplotlib

2. Code structure

deep_Q_learner.py ──> Main script to launch the Deep Q Learning agent
environment ──> Module containing environment wrappers and utility functions
- atari.py ──> Wrappers for preprocessing Atari Gym environment
- __init__.py
- utils.py ──> Environment utility functions to resize, reshape observations using OpenCV
function_approximator ──> Module with neural network implementations
- cnn.py ──> Three-layer CNN implementation using PyTorch
- __init__.py
- perceptron.py ──>Two-layer feed-forward neural network implementation using PyTorch
logs ──> Folder to contain the Tensorboard log files for each run
parameters.json ──> Configuration parameters for the Agent and the environment
README.md
trained_models ──> Folder containing trained-models/"brains" for the agent
utils ──> Module containing utility functions used to train the agent
- decay_schedule.py ──> Decay schedules used by the $$\epsilon-greedy$$ policy
- experience_memory.py ──> Experience Replace memory implementation
- params_manager.py ──> A simple class to manage the agent's and environment's parameters
- weights_initializer.py ──> Xavier Glorot weights initialization method

3. Running the code

The deep_Q_learner.py is the main script that takes care of training and testing depending on the script's arguments. The table below summarizes the arguments that the script supports and what they mean. Note that, most of the agent and environment related configuration parameters are in the parameters.json file and only the few parameters that are more useful when launching the training/testing scripts are made available through the command line arguments.

Argument	Description
`--params-file`	Path to the JSON parameters file. Default=`parameters.json`
`--env`	Name/ID of the Atari environment available in OpenAI Gym. Default=`SeaquestNoFrameskip-v4`
`--gpu-id`	ID of the GPU device to be used. Default=`0`
`--render`	Render the environment to screen. Off by default
`--test`	Run the script in test mode. Learning will be disabled. Off by default
`--record`	Enable recording (Video & stats) of the agent's performance
`--recording-output-dir`	Directory to store the recordings (video & stats). Default=`./trained_models/results`

Training

you can launch the Agent training script from the ~/DQN-for-Atari-games-master directory using the following command:

python deep_Q_learner.py --env RiverraidNoFrameskip-v4 --gpu-id 0

The above command will start training the agent for the Riverraid Atari game (RiverraidNoFrameskip-v4) . If a saved agent "brain" (trained model) is available for the chosen environment, the training script will upload that brain to the agent and continue training the agent to improve further.

The training will run until max_training_steps is reached, which is specified in the parameters.json file. There are several other parameters that can be configured using the parameters.json and it is recommended to adjust them based on the capabilities of the hardware you are running on. You can set use_cuda to false if you are running on a machine without a GPU.

The log files are written to the directory pointed with the summary_file_path_prefix parameter (the default is logs/DQL_*). When the training script is running, you can monitor the learning progress of the agent visually using Tensorboard. From the ~/DQN-for-Atari-games-master directory, you can launch Tensorboard with the following command: tensorboard --log_dir=./logs/. You can then visit the web URL printed on the console (the default one is: http://localhost:6006) to monitor the progress.
Testing

python deep_Q_learner.py --env RiverraidNoFrameskip-v4 --test --render --record

The above command should launch the Deep Q Learning Agent in test mode and render the environment states while also recording the performances. You can find the stats and the recording in the trained_models/results directory after the script finishes running. Sample output for the agent trained on RiverraidNoFrameskip-v4 for a few thousand episode is shown below:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DQN-for-Atari-games

1. Requirements

2. Code structure

3. Running the code

Training

Testing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
environment		environment
function_approximator		function_approximator
logs/DQL_BreakoutNoFrameskip-v0_20-03-28-17-34		logs/DQL_BreakoutNoFrameskip-v0_20-03-28-17-34
trained_models		trained_models
utils		utils
README.md		README.md
deep_Q_learner.py		deep_Q_learner.py
parameters.json		parameters.json

ankitdsi2010/DQN-for-Atari-games

Folders and files

Latest commit

History

Repository files navigation

DQN-for-Atari-games

1. Requirements

2. Code structure

3. Running the code

Training

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages