Releases: cpnota/autonomous-learning-library
Releases · cpnota/autonomous-learning-library
VPG
The release contains two small changes:
- Rename REINFORCE to VPG in order to stay consistent with other libraries. Also, allow VPG to average the gradients over multiple episodes, drastically improving performance in some cases.
- Tweaked A2C to make it align better with other implementations. In particular, a new n-step buffer was added that is more accurate. There are also some small changes to make sure feature gradients are computed correctly.