What's with the outlier highly active component?

It's interesting that in our "solved" resid_mlp1 and resid_mlp2 toy models, one component has a higher ci_mean_per_component_log than the other active ones.
- [resid_mlp1](https://wandb.ai/goodfire/spd/runs/uz5swum7) solved run
- [resid_mlp2](https://wandb.ai/goodfire/spd/runs/grevt2h2) solved run. Note, there's a little bit of noise in layers.1.mlp_in, but I think that's unrelated to this more-active component. You can actually see the two stragglers appearing in the ci_mean_per_component_log plot.

This does not occur in TMS. What do these components do? Do they always form when training with various hyperparameters?

If anyone is looking for ways to contribute, this would be a nice thing to investigate. I think you can even train the resid_mlp1 on a cpu pretty quickly, though I haven't tried in a while.

<img width="1580" height="1180" alt="Image" src="https://github.com/user-attachments/assets/e989c7ef-7c0a-4764-9bdc-4a008ff9f8f4" />
<img width="1531" height="3034" alt="Image" src="https://github.com/user-attachments/assets/db6cdb6c-5a30-4ddb-a61f-91ad4ee26efc" />
<img width="1580" height="2380" alt="Image" src="https://github.com/user-attachments/assets/64dc6a11-3198-41dc-94e4-bb05a2ad9964" />
<img width="1527" height="6034" alt="Image" src="https://github.com/user-attachments/assets/71449e19-9a1a-4c6b-a5c1-7bbbd5c9b74b" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's with the outlier highly active component? #242

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What's with the outlier highly active component? #242

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions