-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Problem
Thinking blocks (<thinking>...</thinking>) consume significant tokens (often 1-3K+ per response) but are never referenced again after generation.
Proposed Solution
Add deterministic removal of thinking blocks (no LLM analysis needed):
- Pattern match thinking block formats (XML tags, message part types)
- Run alongside deduplication in Phase 1
- Config:
removeThinkingBlocks: boolean(default:true)
Could save 5K-30K tokens in typical sessions with extended thinking enabled.
MartinVerges, OscSer and yamiteru
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request