Skip to content

jcarvajal91/doom-net-pytorch

 
 

Repository files navigation

DoomNet

DoomNet is a ViZDoom agent trained by reinforcement learning. The agent is a neural network that outputs a probability of actions given only pixels from the screen buffer and set of game variables.

Models:

  • aac.py is an Advantage Actor Critic model. DoomNet's training runs multiple instances of the game in parallel and performs both forward pass and parameter updates on GPU in the main thread. It is faster than A3C on complex models and if number of parallel episodes, batch size, greatly exceeds number of CPUs.
  • aac_lstm.py is the same as previous model but uses LSTM in place of fully connected layers following CNN.
  • imitation.py is a model that learns to copy behavior of a human player. The trained weights are used to initialize acc.py or aac_lstm.py.

Trained models

Joint work with Bobby DeSimone

DoomNet's view is at left in the middle row
Doom-net track1, elimination round 2017

D3 Battle

Doom-net trained on D3-battle config

Health Gathering

Doom-net trained on health gathering config

Rocket Basic

Doom-net trained on basic rocket config

Basic

Doom-net trained on basic config

About

Reinforcement learning models in ViZDoom environment

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 75.6%
  • Shell 24.4%