Skip to content

Ongoing implementation of α-Tsallis regularized MDP

Notifications You must be signed in to change notification settings

AthomsG/TAC_Thesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning with discrete TAC

Repository for Discrete Tsallis Actor-Critic (Discrete TAC) model implementation for solving reinforcement learning tasks. This model is built using PyTorch and is designed to effectively learn policies that maximize α-Tsallis regularized expected returns in various discrete environments.

About

Ongoing implementation of α-Tsallis regularized MDP

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published