DoomNet is a ViZDoom agent trained by reinforcement learning. The agent is a neural network that outputs a probability of actions given only pixels from the screen buffer and set of game variables.
Models:
- aac.py is an Advantage Actor Critic model. DoomNet's training runs multiple instances of the game in parallel and performs both forward pass and parameter updates on GPU in the main thread. It is faster than A3C on complex models and if number of parallel episodes, batch size, greatly exceeds number of CPUs.
- aac_lstm.py is the same as previous model but uses LSTM in place of fully connected layers following CNN.
- imitation.py is a model that learns to copy behavior of a human player. The trained weights are used to initialize acc.py or aac_lstm.py.
Joint work with Bobby DeSimone