Non-Record: TTSM — Typical Ternary State-Space Model, 2.0032 bpb#1999
Open
dd-dent wants to merge 1 commit intoopenai:mainfrom
Open
Non-Record: TTSM — Typical Ternary State-Space Model, 2.0032 bpb#1999dd-dent wants to merge 1 commit intoopenai:mainfrom
dd-dent wants to merge 1 commit intoopenai:mainfrom
Conversation
First ternary SSM submission. Mamba-1 with B/C projections quantized
to {-1,0,+1}. State is protected: ternary boundary at gates, not
in hidden state. Reversed-scan Triton backward (26x speedup).
11M params, 12MB artifact, 8xH100 SXM.
Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
TTSM: Typical Ternary State-Space Model
val_bpb: 2.0032 (seed 42)
Track: non-record, 10min/16MB
Artifact: 12,039,626 bytes
Params: 11M (7.8M ternary at 1.6 bits/param, 3.3M fp16/fp32 dynamics)
Hardware: 8×H100 SXM, 154 ms/step, 3889 steps in 600s
First ternary SSM submission. Mamba-1 selective SSM with B/C projections quantized to {-1,0,+1} via STE. Hidden state remains fp16 — protected from quantization errors at both write gate (B) and readout selector (C).
Key findings
Compliance
Attributions
See
records/track_non_record_16mb/2026-04-30_TTSM_TernarySSM/README.mdfor full writeup.