Reinforcement learning2 gabriel #22

gabriel-trigo · 2025-03-27T14:47:55Z

Fixes #20 (along with other changes)

This PR should be merged AFTER #18 and #19, as it builds on top of them. Consequently, it has the commits from those PRs, as well as new commits on top

The new commits add:

eval.py script to evaluate policies
generate_gin_config_files.py script to generate variations of gin environment config files
Added visualization module with features to save plots of evaluation runs to eval results
Added implementation of td3 and ddpg agents
Other qol changes (see commit descriptions)

Tests pass

…tory is to have all reinforcement learning related code, and scripts to train and evaluate agents

… data

…or progress of RL experiments

…gin config file, and generates variations changing imporant parameters like time step length, start date and number of days on episode. Also added .bash script to illustrate how to use it

…ntations that were added in previous commits. Changed some default argument values, same for populate_starter_buffer.py script

…visualization module, which this observer uses to plot the graphs

…r of steps instead of episode steps to calculate percentage, leading to > 100% values

… Added saved_model_policy.py file to policies directory -- this file has a class which is used to load and interact with policies saved during training

…eter was redundant with the base_building's time_step_sec

…fore, it was wrongly using the first checkpoint, which gave the untrained agent performance)

…hanges from 111b6e2

smart_control/reinforcement_learning/utils/config.py

gabriel-trigo added 16 commits March 9, 2025 14:30

feat: add reinforcement learning directory. The purpose of this direc…

bb23237

…tory is to have all reinforcement learning related code, and scripts to train and evaluate agents

Merge branch 'reinforcement_learning-gabriel'

f8ac4de

chore: update .gitignore to igonore experiment results, replay buffer…

72241eb

… data

build: add tqdm to project dependencies. Will be used to better monit…

7ec2f9d

…or progress of RL experiments

feat: add generate_gin_config_files.py script, which takes in a base …

989a423

…gin config file, and generates variations changing imporant parameters like time step length, start date and number of days on episode. Also added .bash script to illustrate how to use it

feat: add ddpg agent implementation to agents directory

5c619d2

feat: add td3 implementation to agents directory

e55bd0a

feat: improve train.py script. Added support for td3 and ddpg impleme…

75232f6

…ntations that were added in previous commits. Changed some default argument values, same for populate_starter_buffer.py script

feat: add observer that records and saves trajectories. Also added a …

812565f

…visualization module, which this observer uses to plot the graphs

fix: minor bug in print_status_observer.py. Was using the total numbe…

5694dcb

…r of steps instead of episode steps to calculate percentage, leading to > 100% values

feat: add eval.py script, which is used to evaluate a trained policy.…

6a92d8d

… Added saved_model_policy.py file to policies directory -- this file has a class which is used to load and interact with policies saved during training

docs: add example bash script to run the populate_starter_buffer script

c0b5679

chore: update .gitignore

7f8d416

fix: get rid of step_interval parameter in environment.py (this param…

111b6e2

…eter was redundant with the base_building's time_step_sec

fix: make eval.py script use the latest learned policy checkpoint (be…

4e0dd09

…fore, it was wrongly using the first checkpoint, which gave the untrained agent performance)

tests: fix environment.py tests that were failing to conform to the c…

c15c274

…hanges from 111b6e2

s2t2 reviewed Apr 8, 2025

View reviewed changes

smart_control/reinforcement_learning/utils/config.py Show resolved Hide resolved

s2t2 reviewed Apr 8, 2025

View reviewed changes

smart_control/reinforcement_learning/utils/config.py Show resolved Hide resolved

s2t2 force-pushed the copybara_push branch from cfad6e2 to da325d2 Compare May 16, 2025 16:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reinforcement learning2 gabriel #22

Reinforcement learning2 gabriel #22

Uh oh!

gabriel-trigo commented Mar 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reinforcement learning2 gabriel #22

Are you sure you want to change the base?

Reinforcement learning2 gabriel #22

Uh oh!

Conversation

gabriel-trigo commented Mar 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!