Makes ComponentModel generic over batch and output types#363
Makes ComponentModel generic over batch and output types#363danbraunai-goodfire wants to merge 20 commits intomainfrom
Conversation
- Add `BatchT` and `OutputT` type parameters to ComponentModel and related functions - Replace `pretrained_model_output_attr` with `run_batch` and `reconstruction_loss` callables - Remove `extract_batch_data` and `AliveComponentsTracker` utilities -
This reverts commit e712573.
|
@danbraunai-goodfire I think I definitely went overboard with the abstractions so thanks for reigning this is lol. I do have some suggestions in #375. lmk what you think |
|
@claude I'd like you to do a thorough review of this PR. It's big. I'm worried that there are inconsistencies or that we've made some bad architectural choices. I want critical comments, do feel comfortable to give me the hard truths. |
|
Claude finished @danbraunai-goodfire's task —— View job PR Review: ComponentModel Generic Types
|
|
@claude I've removed OutputT as I noticed that it was always a Tensor. Could you please review this PR now? Again, I'm interested in critical comments. |
|
Claude finished @danbraunai-goodfire's task —— View job PR Review: ComponentModel Generic Types
|
|
@danbraunai-goodfire the reason for abstracting outputT was to enable the usecase where a model outputs a dict and has a more advanced recon loss based on those dict elements |

Description
TODO (either here or afterwards):
Related Issue
Motivation and Context
How Has This Been Tested?
Does this PR introduce a breaking change?