Skip to content

Commit bb80485

Browse files
authored
Improve sweep defaults (#58)
Default to something that works with smaller GPUs. Use the L1 from SAE training guide.
1 parent 2b1e65c commit bb80485

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

sparse_autoencoder/train/sweep_config.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -75,9 +75,9 @@ class SweepParametersRuntime(dict[str, Any]):
7575

7676
adam_weight_decay: float = 0.0
7777

78-
l1_coefficient: float = 0.01
78+
l1_coefficient: float = 0.001
7979

80-
batch_size: int = 8192
80+
batch_size: int = 512
8181

8282
def to_dict(self) -> dict[str, Any]:
8383
"""Return dict representation of this object."""

0 commit comments

Comments
 (0)