(dev -> main) App updates and backwards transformer ci functions by danbraunai-goodfire · Pull Request #355 · goodfire-ai/spd

danbraunai-goodfire · 2026-02-04T12:29:08Z

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

- Rename all classes/functions from ContinuousPGD to PersistentPGD - Rename config files from *-cpgd* to *-ppgd* - Simplify mask shape from (batch, C) to (C,) per module - Single mask shared across all batch elements and ranks - Use fixed seed (PPGD_INIT_SEED=42) for consistent initialization - Keep all_reduce on gradients so updates stay synchronized Co-Authored-By: Claude Opus 4.5 <[email protected]>

This feature tracked which components were considered "alive" based on their firing frequency over time, but wasn't providing value. Old configs with n_examples_until_dead will still load (the key is now ignored). Co-Authored-By: Claude Opus 4.5 <[email protected]>

Add a subset variant of persistent PGD loss that routes to random subsets of modules per position, analogous to how StochasticReconSubsetLoss relates to StochasticReconLoss. - Add PersistentPGDReconSubsetLossConfig with routing field - Make router a required parameter in persistent_pgd_recon_loss - Update run_spd.py to pass appropriate router based on config type - Update tests to pass router parameter Co-Authored-By: Claude Opus 4.5 <[email protected]>

… handling

…ments - Add transition_hidden_dim config for global_reverse_residual CI function - Move CI reading after transition for improved architecture - Add pile_gpt2_simple-2L_global_reverse experiment - Update ss_llama_simple_mlp-2L-wide expected runtime Co-Authored-By: Claude Opus 4.5 <[email protected]>

- Add PersistentPGDMaskScope type with single_mask, broadcast_across_batch, unique_per_batch_per_token options - Change PersistentPGDState to take batch_dims and cfg instead of optimizer_cfg - Add get_grads() method to PersistentPGDState for computing gradients - Refactor compute_losses to return dict of losses instead of total - Update persistent_pgd_recon_loss to be called from compute_losses - Update tests to use new API (skip 2 tests pending LM-style test model) Co-Authored-By: Claude Opus 4.5 <[email protected]>

danbraunai-goodfire · 2026-02-04T21:12:47Z

@claude can you review this PR? Note that your review should cover the scope of this PR only. If you spot things unrelated to this PR, feel free to bring them up and we'll consider them for new issues. I want you to think about possible ways that the implementation could be improved. Including ways it could be re-written completely using different abstractions. I want critical comments, do feel comfortable to give me the hard truths.

claude · 2026-02-04T21:13:01Z

Claude finished @danbraunai-goodfire's task —— View job

PR Review: App updates and backwards transformer ci functions

Review Scope: 288 commits, 100 files changed, focusing on app updates and transformer CI functions

Todo List

Activate virtual environment and setup
Analyze major architectural changes in the app
Review new abstractions and components
Examine backend API changes
Review frontend component refactoring
Analyze config changes across experiments
Provide critical feedback on implementation approach
Suggest alternative architectures and improvements

Critical Review & Hard Truths

🔥 Major Architectural Problems

1. Backend Compute Layer is a God Object (spd/app/backend/compute.py:1-827)

Problem: 827 lines doing gradient computation, edge calculation, intervention logic, AND result formatting
Anti-pattern: Multiple responsibilities violating SRP

Rewrite suggestion:

# Split into focused modules:
spd/app/backend/compute/
  ├── gradient_engine.py     # Core autograd operations
  ├── attribution_builder.py # Edge construction logic  
  ├── intervention_runner.py # Forward passes with masks
  └── result_formatters.py   # Output transformation

2. Database Schema Fighting the Type System (spd/app/backend/database.py)

Problem: StoredGraph uses discriminated unions in Python, but database still has nullable columns for all possible fields
Result: Invalid states are representable at DB level despite type safety in Python
Solution: Either use proper table inheritance OR embrace the union and validate constraints in application code, don't half-ass both approaches

3. Frontend State Management is Chaotic (PromptAttributionsTab.svelte:565+ lines)

Problem: No state machine - implicit transitions everywhere
Examples:
- TabViewState + GraphComputeState + ComposerState can be in invalid combinations
- UI can be in "loading" state while also in "draft" mode
Rewrite suggestion: Use XState or explicit state machine pattern. Current approach doesn't scale.

🚨 Code Quality Issues

4. Layer Aliasing Global State (spd/app/frontend/src/lib/layerAliasing.ts:57)

let cachedArchitecture: Architecture | null = null;

Problem: Global mutable cache that can become stale, not reset between contexts
Solution: Pass architecture context down explicitly or use a proper store

5. Bulk API Endpoint Design (spd/app/backend/routers/component_data.py)

Good: Eliminated GIL contention by batching requests
Bad: Still sequential execution inside the handler - missed opportunity for true parallelism
Alternative: Use asyncio.gather() to run the three data fetches concurrently

6. Configuration Explosion (spd/configs.py)

Added complex global_reverse_residual CI config with 7+ required fields and complex validation
Problem: Config complexity is growing faster than the value it provides
Question: Why not use dataclasses with defaults instead of elaborate Pydantic validation?

💡 Missed Opportunities

7. TypeScript Discriminated Unions Done Halfway

// Current: Can still represent invalid states
type OptimizeConfigDraft = {
    loss: LossConfigDraft;  // Could be invalid
    // ...
}

// Better: Make invalid states unrepresentable  
type OptimizeState = 
  | { status: "draft"; config: Partial<OptimizeConfig> }
  | { status: "valid"; config: ValidOptimizeConfig }
  | { status: "computing"; config: ValidOptimizeConfig }

8. Database Caching Strategy

Current: Ad-hoc SQLite caching with complex uniqueness constraints
Alternative: Consider Redis/in-memory cache for computed graphs + persistent storage for configuration only
Benefit: Simpler schema, better performance for large graphs

🎯 Specific Technical Issues

9. Performance Anti-patterns:

compute_prompt_attributions() does full gradient computation even for display-only graphs
useComponentData and useComponentDataExpectCached are nearly identical - code duplication
Frontend doesn't virtualize large lists (could be 1000+ components)

10. Error Handling Inconsistency:

Some places use assert for validation (good, fail-fast)
Others use try/catch with graceful degradation (bad for research code)
Pick one approach: Research code should fail fast, not silently degrade

🔨 Complete Rewrite Suggestions

Alternative 1: Event-Driven Architecture
Replace current request/response with events:

# Commands
@dataclass
class ComputeAttributionsCommand:
    prompt_id: int
    optimization_params: OptimParams | None

# Events  
@dataclass
class AttributionsComputed:
    prompt_id: int
    graph_data: GraphData

Alternative 2: Functional Core + Imperative Shell
Move all computation to pure functions, keep I/O at edges:

def compute_attributions(model: Model, tokens: Tensor, config: Config) -> Graph:
    # Pure function - no side effects, easy to test
    pass

class AttributionService:
    # Imperative shell - handles caching, DB, progress callbacks
    pass

🚧 Breaking Changes Needed

The current architecture is trying to be too general. Consider:

Remove the three graph types - just have "computed graphs" with optional optimization metadata
Eliminate the database - store graphs as JSON files, much simpler
Split the mega-components - PromptAttributionsTab should be 4-5 smaller components

Bottom Line: This codebase is evolving from research prototype to production-like complexity without the discipline. Either embrace the complexity with proper abstractions, or ruthlessly simplify back to research-code simplicity.

The current middle ground is the worst of both worlds.

* Add batch_invariant scope to persistent PGD Replace PersistentPGDMaskScope string literal with discriminated union (SingleMaskScope, BroadcastAcrossBatchScope, BatchInvariantScope). BatchInvariantScope creates (N, S, C) masks where N divides both train and eval batch sizes, repeated along batch dim at forward time. Removes unique_per_batch_per_token (was eval-incompatible). Includes backwards compat for bare string scope values in configs. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]> * Remove commented-out _elemwise_max_component_mask Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]> * Simplify PersistentPGDState: take seq_len instead of train_batch_dims The batch size validation already happens in the Config validator, so the constructor only needs seq_len for mask shape. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]> * Remove unnecessary .contiguous() and stale comment in get_mask_infos The in-place mask updates in step() happen after autograd.grad and backward() have already consumed the graph, so views are fine. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]> --------- Co-authored-by: Claude SPD1 <[email protected]> Co-authored-by: Claude Opus 4.6 (1M context) <[email protected]>

When None, transitions use a linear layer (d_resid -> d_resid) instead of an MLP (d_resid -> hidden -> d_resid with GeLU). This allows loading older configs that don't have the field. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

…ge (#408) - Update 7 YAML files: bare `lr:` → `lr_schedule:` block in PGD optimizer configs - Update 5 YAML files: deprecated scope names (`batch_invariant` → `repeat_across_batch`, `unique_per_batch_per_token` → `per_batch_per_position`, `n_masks` → `n_sources`) - Remove redundant `coeff: null` from eval_metric_configs across 16 YAML files - Fix misleading error message in persistent_pgd.py (said "use fewer ranks" but fewer ranks makes the problem worse) Co-authored-by: Claude Opus 4.6 <[email protected]>

Refactor hidden activations reconstruction into a unified module with three variants (Stochastic, CI, PGD). Add "output" cache type to ComponentModel for capturing post-weight activations. Remove unused pre_weight_acts param from compute_losses. Co-Authored-By: Claude Opus 4.6 <[email protected]>

- Remove PRAGMA journal_mode=WAL from all 3 DB classes (harvest, autointerp, topological_interp). WAL requires POSIX file locking which breaks on NFS. - Scoring scripts (detection, fuzzing) now accept a separate writable InterpDB instead of writing through the readonly InterpRepo. - Intruder eval opens harvest readonly + separate writable HarvestDB for scores. - Fix try/except → try/finally in interpret.py for proper connection cleanup. - Bump autointerp/eval/intruder jobs to 2 GPUs for memory headroom. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Was serializing AttributionsSlurmConfig (with n_gpus, partition, etc.) instead of just DatasetAttributionConfig to workers. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* Add CI and PGD variants of hidden acts recon loss Refactor hidden activations reconstruction into a unified module with three variants (Stochastic, CI, PGD). Add "output" cache type to ComponentModel for capturing post-weight activations. Remove unused pre_weight_acts param from compute_losses. Co-Authored-By: Claude Opus 4.6 <[email protected]> * Cleanup sum * NOT-REVIEWED: Use existing sources for hidden act loss * Fix typo: PGDHiddenActsReconLoss → PPGDHiddenActsReconLoss in 4L config Co-Authored-By: Claude Opus 4.6 <[email protected]> * PPGDEvalLosses to encompass hidden acts and output recon * Inline _accumulate_into_state() * Run PPGD evals by default if exists in loss metrics * Go back to config-based eval structure * Use PersistentPGDReconSubsetEval and PersistentPGDReconEval * Remove unused registry entry --------- Co-authored-by: Claude Opus 4.6 <[email protected]>

…ts' into dev # Conflicts: # tests/metrics/test_recon_losses.py

* add option to set a separate seed for dataset preparation and for the rest of the decomposition

claude-spd1 and others added 22 commits February 2, 2026 12:43

Restore RMSNorm modules in reverse residual CI

74cdf00

Merge branch 'dev' into feature/continuous-pgd

e8230c5

wip: Add Adam optimizer support for persistent PGD masks

ed48b4e

wip: Refactor persistent PGD: remove PersistentPGDResult, update mask…

53cbea9

… handling

wip: Fix persistent PGD loss weighting and variable naming

833e69f

Misc changes merged from main

db6aa82

Merge branch 'main' into dev

8609521

Merge branch 'main' into dev

38f37ac

Merge branch 'main' into dev

ba149fe

wip: Extract mask shape computation before loop to reduce duplication

b5ffbfb

merge

aad4413

clean up configs

cd04fc5

fix eval compatibility

e849997

clean up ppgd mask types

b7a41da

remove registry additions

58c7aeb

fix test and include perspgd in eval

16511ae

merge

13b4780

claude-spd1 added 2 commits February 5, 2026 10:31

Merge branch 'main' into dev

4be170e

Merge branch 'global-shared-transformer-ci' into dev

40c847d

danbraunai-goodfire changed the title ~~App updates and backwards transformer ci functions~~ (dev -> main) App updates and backwards transformer ci functions Feb 6, 2026

ocg-goodfire and others added 3 commits February 6, 2026 14:19

Merge branch 'main' into dev

d9fddd4

danbraunai-goodfire and others added 30 commits February 19, 2026 18:07

Fix attributions SLURM passing full config instead of inner config

d39c91d

Was serializing AttributionsSlurmConfig (with n_gpus, partition, etc.) instead of just DatasetAttributionConfig to workers. Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Cleanup sum

8ef8faa

NOT-REVIEWED: Use existing sources for hidden act loss

6d576a0

Merge branch 'main' into dev

3e3d101

Fix typo: PGDHiddenActsReconLoss → PPGDHiddenActsReconLoss in 4L config

26fc5ee

Co-Authored-By: Claude Opus 4.6 <[email protected]>

PPGDEvalLosses to encompass hidden acts and output recon

6cf7b03

Inline _accumulate_into_state()

cbbc47d

Run PPGD evals by default if exists in loss metrics

15644a4

Go back to config-based eval structure

5bf47bd

Use PersistentPGDReconSubsetEval and PersistentPGDReconEval

a0dde78

Init different ppgd sources in different ranks for PerBatchPerPosition

77a3689

Update config

d067a3a

Remove unused registry entry

ece9d3d

Merge dev into feature/hidden-acts-recon-variants

0001c6c

Update tests

00de5ec

Merge remote-tracking branch 'origin/feature/hidden-acts-recon-varian…

214772e

…ts' into dev # Conflicts: # tests/metrics/test_recon_losses.py

Update configs to use new pretrain t-9d2b8f02

a8919d4

Add dataset_seed option to LMTaskConfig (#416)

5784de1

* add option to set a separate seed for dataset preparation and for the rest of the decomposition

Add StochasticAttentionPatternsReconLoss metric (#402)

ac5dc77

Merge branch 'main' into dev

3da1b4e

Reduce memory usage in clustering

9ebb467

Make clustering happen on cuda

c5fc8da

Update config

8716e32

Update cluster mapping docs

58a36b1

Update cluster_mapping to use cluster run instead of ensemble

75d5f06

Merge branch 'main' into dev

2ab728c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(dev -> main) App updates and backwards transformer ci functions#355

(dev -> main) App updates and backwards transformer ci functions#355
danbraunai-goodfire wants to merge 519 commits intomainfrom
dev

danbraunai-goodfire commented Feb 4, 2026

Uh oh!

danbraunai-goodfire commented Feb 4, 2026

Uh oh!

claude bot commented Feb 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

danbraunai-goodfire commented Feb 4, 2026

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

Uh oh!

danbraunai-goodfire commented Feb 4, 2026

Uh oh!

claude bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: App updates and backwards transformer ci functions

Todo List

Critical Review & Hard Truths

🔥 Major Architectural Problems

🚨 Code Quality Issues

💡 Missed Opportunities

🎯 Specific Technical Issues

🔨 Complete Rewrite Suggestions

🚧 Breaking Changes Needed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

claude bot commented Feb 4, 2026 •

edited

Loading