Skip to content

Releases: cpnota/autonomous-learning-library

VPG

31 May 21:47
021f0a0
Compare
Choose a tag to compare
VPG Pre-release
Pre-release

The release contains two small changes:

  1. Rename REINFORCE to VPG in order to stay consistent with other libraries. Also, allow VPG to average the gradients over multiple episodes, drastically improving performance in some cases.
  2. Tweaked A2C to make it align better with other implementations. In particular, a new n-step buffer was added that is more accurate. There are also some small changes to make sure feature gradients are computed correctly.

A2C

14 Apr 18:44
Compare
Choose a tag to compare
A2C Pre-release
Pre-release
  • Add the a2c algorithm
  • Support for running parallel environments
  • Record the loss function and other information
  • Simplify the installation process
  • Reenable TravisCI tests

CUDA

06 Apr 22:11
efb3f5d
Compare
Choose a tag to compare
CUDA Pre-release
Pre-release
  1. Add CUDA support
  2. Refactor list handling in function approximators
  3. Tweak the organization of presets
  4. Improve the benchmarks API

DQN

22 Mar 23:09
589ee9e
Compare
Choose a tag to compare
DQN Pre-release
Pre-release

Includes a fully working DQN implementation, as well as a working partial Rainbow. Also includes Actor-Critic, Sarsa, and Reinforce implementations for classic control environments.