feat: stream prompt_sandbox output for faster first token by sweetmantech · Pull Request #246 · recoupable/api

sweetmantech · 2026-02-26T18:56:16Z

Summary

Replace blocking MCP prompt_sandbox with a local AI SDK generator tool that streams sandbox output in real-time
Use runCommand({ detached: true }) + cmd.logs() async generator to yield chunks as they arrive
Reduces first visible token from 10-60s to ~3-8s by streaming booting → streaming → complete status updates

Changes

File	Action
`lib/sandbox/promptSandboxStreaming.ts`	New — async generator wrapping detached sandbox command
`lib/chat/tools/createPromptSandboxStreamingTool.ts`	New — AI SDK generator tool factory
`lib/chat/setupToolsForRequest.ts`	Modified — add local streaming tool override
`lib/mcp/tools/sandbox/index.ts`	Modified — remove registerPromptSandboxTool
`lib/mcp/tools/sandbox/registerPromptSandboxTool.ts`	Deleted — replaced by local tool
`lib/sandbox/promptSandbox.ts`	Deleted — replaced by streaming version

Test plan

5 tests for promptSandboxStreaming (chunk ordering, stderr accumulation, detached mode, created flag, exit codes)
4 tests for createPromptSandboxStreamingTool (status progression, param passing, stderr handling, schema)
3 new tests for setupToolsForRequest (override behavior, authToken gating)
All 152 test files pass (1229 tests)

🤖 Generated with Claude Code

Summary by CodeRabbit

Release Notes

New Features
- Real-time streaming sandbox output enables live visibility into code execution with immediate updates to stdout and stderr.
- Persistent sandbox instances allow follow-up prompts within the same environment, improving workflow continuity.
Updates
- Tool system now prioritizes local streaming implementations to enhance real-time capabilities and performance.

…t token Replace blocking MCP prompt_sandbox with a local AI SDK generator tool that streams sandbox output in real-time. Uses detached runCommand + cmd.logs() to yield chunks as they arrive, reducing first visible token from 10-60s to ~3-8s. Co-Authored-By: Claude Opus 4.6 <[email protected]>

vercel · 2026-02-26T18:56:22Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
recoup-api	Ready	Preview	Feb 26, 2026 7:14pm

coderabbitai · 2026-02-26T18:56:46Z

Warning

Rate limit exceeded

@sweetmantech has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 13 minutes and 14 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between 0b62c78 and a2f9bf0.

⛔ Files ignored due to path filters (2)

lib/chat/tools/__tests__/createPromptSandboxStreamingTool.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**
lib/sandbox/__tests__/promptSandboxStreaming.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**

📒 Files selected for processing (1)

lib/chat/tools/createPromptSandboxStreamingTool.ts

📝 Walkthrough

Walkthrough

The changes migrate the prompt_sandbox tool from MCP server registration to a local streaming tool integrated in setupToolsForRequest. The synchronous promptSandbox implementation is replaced with an async generator promptSandboxStreaming that provides real-time log streaming, while local streaming tools are merged last into the tool set to override MCP-provided equivalents.

Changes

Cohort / File(s)	Summary
Local Streaming Tool Creation `lib/chat/tools/createPromptSandboxStreamingTool.ts`	New module defining a local AI SDK tool with generator-based execute function that streams sandbox output in real-time, including booting state, log iteration, and final state with stdout, stderr, exitCode, sandboxId, and creation flag.
Streaming Sandbox Interface `lib/sandbox/promptSandboxStreaming.ts`	New async generator module replacing synchronous promptSandbox, providing real-time log streaming from OpenClaw sandbox execution with per-account sandbox management and accumulated output collection.
Tool Setup Integration `lib/chat/setupToolsForRequest.ts`	Integrates local streaming tools into request setup by importing createPromptSandboxStreamingTool and merging localStreamingTools last into the overall tool set to override MCP-provided tools.
MCP Registration Removal `lib/mcp/tools/sandbox/index.ts`, `lib/mcp/tools/sandbox/registerPromptSandboxTool.ts` (deleted), `lib/sandbox/promptSandbox.ts` (deleted)	Removes MCP server registration of prompt_sandbox tool and deletes the synchronous promptSandbox implementation, consolidating sandbox tooling to local streaming approach with a clarifying comment.

Sequence Diagram(s)

sequenceDiagram
    participant Client as Chat Client
    participant Setup as setupToolsForRequest
    participant Tool as createPromptSandboxStreamingTool
    participant Stream as promptSandboxStreaming
    participant Sandbox as OpenClaw Sandbox

    Client->>Setup: Initiate tool setup with authToken
    Setup->>Tool: Create streaming tool (accountId, apiKey)
    Tool->>Tool: Register generator-based execute function
    Setup->>Setup: Merge localStreamingTools last (override MCP tools)
    Client->>Tool: Execute with prompt parameter
    Tool->>Stream: Invoke async generator (accountId, apiKey, prompt)
    Stream->>Sandbox: Get or create per-account sandbox
    Stream->>Sandbox: Run command: openclaw agent --agent main --message <prompt>
    loop Stream logs in real-time
        Sandbox-->>Stream: Log chunk (stdout/stderr)
        Stream->>Stream: Accumulate output & yield log entry
        Stream-->>Tool: Streaming update
        Tool-->>Client: Yield intermediate result
    end
    Sandbox-->>Stream: Process termination (exitCode)
    Stream->>Stream: Finalize with aggregated stdout/stderr/exitCode
    Stream-->>Tool: Return complete state
    Tool-->>Client: Yield final result (summary object)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~22 minutes

Possibly related PRs

feat: add prompt_sandbox MCP tool for persistent sandbox prompting #244: Introduces MCP-registered prompt_sandbox tool and synchronous promptSandbox implementation; this PR removes that MCP registration and synchronous function in favor of the new local streaming approach—inverse architectural transformation.
fix: replace opencode with openclaw in prompt param conversion #234: Ensures sandbox command execution uses openclaw agent --agent main --message format, which is also adopted in the new streaming sandbox implementation.

Poem

🌊 From sync to stream, the tooling takes flight,
MCP fades out as local tools glow bright.
Real-time logs now dance in the sand,
Where OpenClaw agents heed the command. ✨

🚥 Pre-merge checks | ✅ 1

✅ Passed checks (1 passed)

Check name	Status	Explanation
Solid & Clean Code	✅ Passed	Code demonstrates strong SOLID & Clean Code adherence with proper separation of concerns across layers, single responsibility per function, DRY principle application, descriptive naming, type safety via Zod schemas, and consistent error handling.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch sweetmantech/myc-4351-api-prompt_sandbox-tool-stream-logs-for-faster-first-token

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (2)

lib/sandbox/promptSandboxStreaming.ts (1)

35-45: Consider adding error handling for sandbox operations.

If getOrCreateSandbox or sandbox.runCommand throws, the error will propagate uncaught. Consider wrapping these operations in try-catch to provide more context-rich errors or ensure graceful degradation.

🛡️ Example error handling pattern

+  let sandbox;
+  let sandboxId: string;
+  let created: boolean;
+
+  try {
+    const result = await getOrCreateSandbox(accountId);
+    sandbox = result.sandbox;
+    sandboxId = result.sandboxId;
+    created = result.created;
+  } catch (error) {
+    throw new Error(`Failed to acquire sandbox for account ${accountId}: ${error instanceof Error ? error.message : String(error)}`);
+  }
-  const { sandbox, sandboxId, created } =
-    await getOrCreateSandbox(accountId);

-  const cmd = await sandbox.runCommand({
+  const cmd = await sandbox.runCommand({
     cmd: "openclaw",
     args: ["agent", "--agent", "main", "--message", prompt],
     env: {
       RECOUP_API_KEY: apiKey,
     },
     detached: true,
-  });
+  }).catch((error) => {
+    throw new Error(`Failed to start sandbox command: ${error instanceof Error ? error.message : String(error)}`);
+  });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@lib/sandbox/promptSandboxStreaming.ts` around lines 35 - 45, Wrap the calls
to getOrCreateSandbox(accountId) and sandbox.runCommand({...}) in a try-catch
inside promptSandboxStreaming.ts so sandbox creation/command failures are caught
and annotated with context (accountId, sandboxId when available) before
rethrowing or returning a controlled error; specifically, catch errors around
getOrCreateSandbox and the call that produces cmd, log or attach the original
error message and relevant identifiers (sandboxId, created flag, command args
like "openclaw" and prompt) and ensure any created resources are cleaned up or
rolled back if needed before propagating the enriched error.

lib/chat/tools/createPromptSandboxStreamingTool.ts (1)

40-63: Redundant stdout accumulation - already tracked in promptSandboxStreaming.

The stdout variable is accumulated here (lines 58-60) even though promptSandboxStreaming already maintains accumulated stdout and stderr in its return value (lines 51-52 overwrite with the final values anyway). This duplication violates DRY and wastes memory for long-running processes.

You can simplify by only using the yielded chunks for streaming updates and relying on the final return value for the complete output.

♻️ Simplified approach using only streamed chunks for display

       let stdout = "";
       let stderr = "";
       let exitCode = 0;
       let sandboxId = "";
       let created = false;
+      let streamedOutput = "";

       while (true) {
         const { value, done } = await gen.next();

         if (done) {
           sandboxId = value.sandboxId;
           stdout = value.stdout;
           stderr = value.stderr;
           exitCode = value.exitCode;
           created = value.created;
           break;
         }

         if (value.stream === "stdout") {
-          stdout += value.data;
+          streamedOutput += value.data;
         }

-        yield { status: "streaming" as const, output: stdout };
+        yield { status: "streaming" as const, output: streamedOutput };
       }

       yield {
         status: "complete" as const,
         output: stdout,
         stderr,
         exitCode,
       };

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@lib/chat/tools/createPromptSandboxStreamingTool.ts` around lines 40 - 63,
Remove the redundant local accumulation of stdout/stderr inside the streaming
loop: stop maintaining the top-level stdout/stderr strings (and their
initializations) and do not append value.data into stdout on each chunk.
Instead, use the yielded chunks for streaming updates (e.g., yield { status:
"streaming", output: <currentChunkOrClient-side-accumulation> } using
value.data) and when the generator finishes, read the final complete outputs
(value.stdout / value.stderr) and other metadata (value.sandboxId,
value.exitCode, value.created) from the final returned value. Update references
in this function (the gen loop and the final-done branch) so the final complete
outputs come only from the generator's return values and remove the local
stdout/stderr accumulation logic.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@lib/chat/tools/createPromptSandboxStreamingTool.ts`:
- Around line 40-63: Remove the redundant local accumulation of stdout/stderr
inside the streaming loop: stop maintaining the top-level stdout/stderr strings
(and their initializations) and do not append value.data into stdout on each
chunk. Instead, use the yielded chunks for streaming updates (e.g., yield {
status: "streaming", output: <currentChunkOrClient-side-accumulation> } using
value.data) and when the generator finishes, read the final complete outputs
(value.stdout / value.stderr) and other metadata (value.sandboxId,
value.exitCode, value.created) from the final returned value. Update references
in this function (the gen loop and the final-done branch) so the final complete
outputs come only from the generator's return values and remove the local
stdout/stderr accumulation logic.

In `@lib/sandbox/promptSandboxStreaming.ts`:
- Around line 35-45: Wrap the calls to getOrCreateSandbox(accountId) and
sandbox.runCommand({...}) in a try-catch inside promptSandboxStreaming.ts so
sandbox creation/command failures are caught and annotated with context
(accountId, sandboxId when available) before rethrowing or returning a
controlled error; specifically, catch errors around getOrCreateSandbox and the
call that produces cmd, log or attach the original error message and relevant
identifiers (sandboxId, created flag, command args like "openclaw" and prompt)
and ensure any created resources are cleaned up or rolled back if needed before
propagating the enriched error.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d095105 and 0b62c78.

⛔ Files ignored due to path filters (5)

lib/chat/__tests__/setupToolsForRequest.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**
lib/chat/tools/__tests__/createPromptSandboxStreamingTool.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**
lib/mcp/tools/sandbox/__tests__/registerPromptSandboxTool.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**
lib/sandbox/__tests__/promptSandbox.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**
lib/sandbox/__tests__/promptSandboxStreaming.test.ts is excluded by !**/*.test.*, !**/__tests__/** and included by lib/**

📒 Files selected for processing (6)

lib/chat/setupToolsForRequest.ts
lib/chat/tools/createPromptSandboxStreamingTool.ts
lib/mcp/tools/sandbox/index.ts
lib/mcp/tools/sandbox/registerPromptSandboxTool.ts
lib/sandbox/promptSandbox.ts
lib/sandbox/promptSandboxStreaming.ts

💤 Files with no reviewable changes (2)

lib/mcp/tools/sandbox/registerPromptSandboxTool.ts
lib/sandbox/promptSandbox.ts

- Use explicit type assertions for IteratorResult narrowing (TS can't narrow IteratorYieldResult due to `done?: false`) - Replace `tool()` helper with plain Tool object (helper can't infer generator generics in [email protected]) - Use `inputSchema` field directly instead of `parameters` Co-Authored-By: Claude Opus 4.6 <[email protected]>

sweetmantech mentioned this pull request Feb 26, 2026

feat: add prompt_sandbox streaming progress UI recoupable/chat#1548

Merged

3 tasks

vercel bot had a problem deploying to Preview February 26, 2026 18:56 Failure

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

vercel bot deployed to Preview February 26, 2026 19:14 View deployment

sweetmantech merged commit c854a1f into test Feb 26, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: stream prompt_sandbox output for faster first token#246

feat: stream prompt_sandbox output for faster first token#246
sweetmantech merged 2 commits intotestfrom
sweetmantech/myc-4351-api-prompt_sandbox-tool-stream-logs-for-faster-first-token

sweetmantech commented Feb 26, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

vercel bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Feb 26, 2026 •

edited

Loading

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sweetmantech commented Feb 26, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Summary by CodeRabbit

Release Notes

Uh oh!

vercel bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

sweetmantech commented Feb 26, 2026 •

edited by coderabbitai bot

Loading

vercel bot commented Feb 26, 2026 •

edited

Loading

coderabbitai bot commented Feb 26, 2026 •

edited

Loading