Compose split routed experts from vLLM responses by S1ro1 · Pull Request #1349 · PrimeIntellect-ai/verifiers

S1ro1 · 2026-05-11T21:32:45Z

Summary

add a compact RoutedExperts token payload type backed by int16 bytes
compose vLLM split prompt_routed_experts plus completion routed experts into one sequence-aligned payload
decode only the compact base64 object emitted by patched vLLM; the old base85/list routed-experts path is not supported
update chat, completions, and renderer clients to consume the new split routed-experts response shape
preserve routed experts when response tokens are truncated
merge current main so the PR also carries the renderer multimodal sidecar changes without conflicts

Validation

uv run pytest tests/test_renderer_client.py tests/test_env_server.py -q
uvx ruff@0.15.12 format --isolated --check .
uvx ruff@0.15.12 check --isolated .

Note

Medium Risk
Touches token parsing/serialization paths across multiple clients and changes the routed_experts wire format/type, so malformed/partial payloads or shape mismatches could break downstream consumers and truncation behavior.

Overview
Adds a new RoutedExperts bytes-based type plus verifiers/clients/routed_experts.py utilities to decode base64 int16 routed-expert payloads and compose split prompt+completion routing into a single sequence-aligned buffer.

Updates the OpenAI chat, OpenAI completions, and renderer clients to read routed-expert data from model_extra/response fields (prompt_routed_experts + completion routed_experts), removing the previous inline base85/NumPy decode path, and threads the composed routing into ResponseTokens.

Adjusts response token truncation to slice the new bytes-based routed-experts buffer correctly so routing metadata remains consistent when prompts/completions are clipped.

^{Reviewed by Cursor Bugbot for commit 162cffb. Bugbot is set up for automated code reviews on this repo. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 8dbd674. Configure here.}

…)" This reverts commit 5be2f9e.

…ion (#1198)"" This reverts commit 591914a.

…rts-revert-1198 # Conflicts: # verifiers/clients/renderer_client.py # verifiers/types.py

willccbb · 2026-05-12T20:04:41Z

can we put this in a utils file? trying to keep most folders unified by object type, e.g clients is full of _client.py files at top-level. could be verifiers.utils.router_utils/client_utils or verifiers.clients.utils.router_utils

S1ro1 · 2026-05-12T20:06:05Z

can we put this in a utils file?

Yeah can clean up after, for now not 100% sure we can merge, it hits inference speed quite a lot and adds a lot of engineering overhead on prime-rl.

S1ro1 mentioned this pull request May 11, 2026

Support routed experts replay for vLLM P/D PrimeIntellect-ai/prime-rl#2474

Open

S1ro1 force-pushed the feat/split-routed-experts branch from 303d88b to 277ab6e Compare May 11, 2026 22:22

feat: compose split routed experts

8dbd674

S1ro1 force-pushed the feat/split-routed-experts branch from 277ab6e to 8dbd674 Compare May 11, 2026 22:28

S1ro1 marked this pull request as ready for review May 11, 2026 22:42

cursor Bot reviewed May 11, 2026

View reviewed changes

Comment thread verifiers/utils/response_utils.py Outdated

Comment thread verifiers/clients/routed_experts.py

Comment thread verifiers/clients/openai_chat_completions_client.py Outdated

S1ro1 added 6 commits May 12, 2026 08:45

feat: decode base64 routed experts

b76a6bb

Revert "fix: narrow send_cancel BaseException catch to Exception (#1198…

591914a

…)" This reverts commit 5be2f9e.

feat: keep routed experts compact

d70c07f

Revert "Revert "fix: narrow send_cancel BaseException catch to Except…

2423ba3

…ion (#1198)"" This reverts commit 591914a.

chore: narrow routed experts payload handling

0c6fcff

Merge remote-tracking branch 'origin/main' into fix/split-routed-expe…

162cffb

…rts-revert-1198 # Conflicts: # verifiers/clients/renderer_client.py # verifiers/types.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compose split routed experts from vLLM responses#1349

Compose split routed experts from vLLM responses#1349
S1ro1 wants to merge 7 commits into
mainfrom
feat/split-routed-experts

S1ro1 commented May 11, 2026 •

edited

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

willccbb commented May 12, 2026 •

edited

Loading

Uh oh!

S1ro1 commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

S1ro1 commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

willccbb commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

S1ro1 commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

S1ro1 commented May 11, 2026 •

edited

Loading

willccbb commented May 12, 2026 •

edited

Loading