changes for latest versions of BenchMARL #3

karthiks1701 · 2025-05-01T21:54:48Z

A few adaptations for DiCo to work with the latest BenchMARL codebase. No need to install the specific branches of tensordict, torchRL, and BenchMARL. Let me know if additional testing is required.

SND curves for the sampling environment

Final learned policy
https://github.com/user-attachments/assets/4ba5ae75-02d2-4d80-868b-03bd538af34a

karthiks1701 · 2025-05-01T21:58:49Z

het_control/models/het_control_mlp_empirical.py

@@ -143,7 +149,9 @@ def _forward(
        else:  # Gather outputs for one agent on the obs
            # tensor of shape [*batch, n_agents, n_actions], where the outputs
            # along the n_agent dimension are taken with the same (agent_index) agent network
-            agent_out = self.agent_mlps.agent_networks[agent_index].forward(input)
+            # agent_out = self.agent_mlps.agent_networks[agent_index].forward(input)


This is the key change as agent_networks is no longer supported in torchRL.

matteobettini

Thanks a mil, just a few qs

matteobettini · 2025-05-02T11:40:50Z

het_control/conf/sampling_iddpg_config.yaml

@@ -13,7 +13,7 @@ use_action_loss: True
 action_loss_lr: 0.00003

 experiment:
-  max_n_frames: 5_000_000
+  max_n_frames: 1_000_000


I would not change the default config for reproducibility

sorry, about that I didnt want to run for longer, so pushed this by mistake.

matteobettini · 2025-05-02T11:42:23Z

het_control/models/het_control_mlp_empirical.py

+            distance = self.estimate_snd(input)
+            if update_estimate:
+                self.estimated_snd[:] = distance.detach()


Could you expalin this a bit?

If those conditions are met, we can avoid computing $\widehat{\mathrm{SND}}$

I did this to be able to log the estimated_snd during training when the desired snd is -1. Right now it logs Nan's. It was just to be able to see the evolution of snd while training as well. Can be remove if necessary.

I see, but you can still see it under eval/snd no?

yes, But if I understand that is only during evaluation right? This was helpful in understanding how the SND evolves while training. But you are right eval/snd is enough. Should we roll back to the previous version?

ok got it, i ll take care of things don't worry

changes for new version of BenchMaRL

9496cba

karthiks1701 changed the title ~~changes for new version of BenchMaRL~~ changes for latest versions of BenchMaRL May 1, 2025

karthiks1701 changed the title ~~changes for latest versions of BenchMaRL~~ changes for latest versions of BenchMARL May 1, 2025

karthiks1701 commented May 1, 2025

View reviewed changes

matteobettini approved these changes May 2, 2025

View reviewed changes

restore params of sampling_iddpg_config.yaml

c67511e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

changes for latest versions of BenchMARL #3

changes for latest versions of BenchMARL #3

Uh oh!

karthiks1701 commented May 1, 2025 •

edited

Loading

Uh oh!

karthiks1701 May 1, 2025

Uh oh!

matteobettini left a comment

Uh oh!

matteobettini May 2, 2025

Uh oh!

karthiks1701 May 2, 2025

Uh oh!

matteobettini May 2, 2025

Uh oh!

karthiks1701 May 2, 2025

Uh oh!

matteobettini May 2, 2025

Uh oh!

karthiks1701 May 2, 2025

Uh oh!

matteobettini May 2, 2025

Uh oh!

Uh oh!

changes for latest versions of BenchMARL #3

Are you sure you want to change the base?

changes for latest versions of BenchMARL #3

Uh oh!

Conversation

karthiks1701 commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matteobettini left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

karthiks1701 commented May 1, 2025 •

edited

Loading