GitHub - snnclsr/rl: Repository about reinforcement learning.

In this repository, I will share my knowledge and works about reinforcement learning. I will write blog posts and implement the algorithms in order to understand them well. I will also share some articles that will help to understand the concepts better.

Algorithms

Vanilla Policy Gradients(REINFORCE)

Check out the blog post for detailed explanation.

Summary: Policy gradient algorithms directly learn/optimize the policy. We generate samples from the environment. We calculate the sum of gradients along the samples and, also we compute the total reward for each sample. We multiply them and optimize with gradient ascent.

Code

Skeleton code for the implementation is taken from Berkeley RL Course Assignment 2 which can be found here. Also check out the course content from here.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
DQN.ipynb		DQN.ipynb
README.md		README.md
logz.py		logz.py
vpg.py		vpg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Algorithms

Vanilla Policy Gradients(REINFORCE)

About

Releases

Packages

Languages

snnclsr/rl

Folders and files

Latest commit

History

Repository files navigation

Algorithms

Vanilla Policy Gradients(REINFORCE)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages