Skip to content
/ ppo Public

TF2 Implementation of Proximal Policy Optimization

Notifications You must be signed in to change notification settings

shakti365/ppo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Proximal Policy Optimization

Implementation of PPO Algorithm in TF2

Notes: https://shivamshakti.dev/posts/ppo

Usage

  • Create a virtual environment for Python (I use this setup)

  • Install the dependencies

    pip install -r requirements.txt
    
  • Run the training script

    cd src
    python main.py # Uses `MountainCarContinuous-v0` by default
    
  • Run the evaluation script

    python play.py --model_name <PATH_TO_SAVED_MODEL>
    

References

About

TF2 Implementation of Proximal Policy Optimization

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages