Skip to content

Latest commit

 

History

History
36 lines (21 loc) · 1.12 KB

README.md

File metadata and controls

36 lines (21 loc) · 1.12 KB

Reinforcement-learning

This is a PyTorch implementation of Deep Q-Learning.

Example (1)- A result from the Atari_Breakout.

Experience memory capacity: 80000 set
Random action ratio: from 1.0 to 0.1 through 1000000 frames

Best eposide:

54_8520.mp4

Learning scores:

Blue line: Score of training episode.
Magenta line: Mean score of last 100 training episodes.
Red star: Score of target agent.

Example (2)- A result from the Cartpole.

Experience memory capacity: 10000 set
Random action ratio: from 0.9 to 0.05 through 200 episodes

Best eposide:

221_172.mp4

Learning scores:

Blue line: Score of training episode.
Magenta line: Mean score of last 100 training episodes.