feat: OpenRouter reasoning support with multi-turn pass-back by 82deutschmark · Pull Request #4 · sonpham-org/AmnesiaBench

82deutschmark · 2026-03-30T17:01:43Z

What

Adds proper OpenRouter reasoning support to the bench:

Sends reasoning: {enabled: true} in the request payload for OpenRouter models
Preserves reasoning_details in assistant messages for multi-turn conversations
Adds --no-reasoning flag to disable when reasoning adds unwanted overhead
Localhost llama.cpp calls are completely unaffected

Compaction and reasoning tokens

Verified: OpenRouter includes reasoning tokens in completion_tokens and total_tokens. The compaction trigger (total_now >= token_limit) already counts reasoning tokens — no changes needed.

Previous fix included

Branch includes commit 7ce8d14 which reads both reasoning_content (llama.cpp) and reasoning (OpenRouter) from streaming deltas.

Send reasoning: {enabled: true} in the request payload for OpenRouter models so they return reasoning tokens. Preserve reasoning_details in assistant messages for multi-turn conversations. Add --no-reasoning flag to disable this when reasoning adds unwanted overhead. Localhost llama.cpp calls are unaffected (reasoning_enabled defaults to False in LLMClient, only set True via create_client for openrouter://). Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: OpenRouter reasoning support with multi-turn pass-back#4

feat: OpenRouter reasoning support with multi-turn pass-back#4
82deutschmark wants to merge 1 commit intov2-multi-modelfrom
openrouter-reasoning-support

82deutschmark commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

82deutschmark commented Mar 30, 2026

What

Compaction and reasoning tokens

Previous fix included

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant