Skip to content

Conversation

@chindris-mihai-alexandru
Copy link
Contributor

Adds docs/MULTI-AGENT-SETUP.md showing how to reduce rate limits by using Antigravity only for complex tasks and free providers (like Cerebras) for fast subagents.

Closes #257

Copilot AI review requested due to automatic review settings January 20, 2026 22:50
@coderabbitai
Copy link
Contributor

coderabbitai bot commented Jan 20, 2026

Note

Other AI code review bot(s) detected

CodeRabbit has detected other AI code review bot(s) in this pull request and will avoid duplicating their findings in the review comments. This may lead to a less comprehensive review.

Walkthrough

Adds a new documentation file at docs/MULTI-AGENT-SETUP.md describing a multi-agent architecture: a Primary Agent using Antigravity for complex reasoning, Fast subagents using Cerebras for simple/low-latency tasks, and an optional security-auditor agent. Defines model tiers (Heavy and Fast), provides example configurations for primary, subagent, and auditor roles, and discusses rationale for rate-limit management and balancing quality vs. cost. Related links are included.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~4 minutes

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Title check ✅ Passed The title 'docs: add multi-agent setup guide' directly and concisely summarizes the main change, which is adding a new documentation file about multi-agent configuration.
Description check ✅ Passed The description clearly explains the purpose of the new documentation file and references the linked issue #257, directly relating to the changeset.
Linked Issues check ✅ Passed The PR fulfills all coding objectives from issue #257: demonstrates Antigravity for primary agent, recommends free providers for subagents, provides example configurations with mode distinctions, and documents quota reduction patterns.
Out of Scope Changes check ✅ Passed The single file addition is directly aligned with issue #257's scope, containing only the requested multi-agent setup guide documentation with no unrelated changes.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


📜 Recent review details

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between 3b05339 and b2f7517.

📒 Files selected for processing (1)
  • docs/MULTI-AGENT-SETUP.md
🚧 Files skipped from review as they are similar to previous changes (1)
  • docs/MULTI-AGENT-SETUP.md
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
  • GitHub Check: Greptile Review

✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@greptile-apps
Copy link
Contributor

greptile-apps bot commented Jan 20, 2026

Greptile Summary

Adds comprehensive documentation for multi-agent configuration that helps users reduce Antigravity rate limit usage by delegating simple tasks to free providers like Cerebras while reserving Antigravity for complex reasoning tasks.

  • Documents a clear architecture pattern: primary agent (Antigravity) delegates to specialized subagents (Cerebras for speed, Antigravity Sonnet for security)
  • Provides three ready-to-use configuration examples for different agent types
  • Explains the cost/performance tradeoff: most API calls go to free Cerebras (~2100 tokens/sec), complex reasoning uses Claude thinking
  • Includes helpful model tier comparison table and links to related resources

Confidence Score: 5/5

  • This PR is safe to merge with no risk - it only adds documentation
  • Documentation-only change that adds a helpful guide addressing a common user pain point (rate limits). The content is well-structured, clear, and aligns with existing documentation patterns in the repository. No code changes means zero risk of introducing bugs or breaking changes.
  • No files require special attention

Important Files Changed

Filename Overview
docs/MULTI-AGENT-SETUP.md Adds comprehensive guide for multi-agent configuration to reduce rate limits using Antigravity for complex tasks and free providers for subagents

Sequence Diagram

sequenceDiagram
    participant User
    participant Primary as Primary Agent<br/>(Antigravity Claude Opus)
    participant Explore as Explore Subagent<br/>(Cerebras llama-3.3-70b)
    participant Researcher as Researcher Subagent<br/>(Cerebras llama-3.3-70b)
    participant Coder as Coder Subagent<br/>(Cerebras llama-3.3-70b)
    participant Security as Security Auditor<br/>(Antigravity Sonnet)
    participant Antigravity as Antigravity API<br/>(Rate Limited)
    participant Cerebras as Cerebras API<br/>(Free, Fast)

    User->>Primary: Complex task request
    Note over Primary: Complex reasoning<br/>uses Antigravity quota
    Primary->>Antigravity: API call (quota used)
    Antigravity-->>Primary: Response
    
    Primary->>Explore: Delegate exploration task
    Explore->>Cerebras: API call (free, fast)
    Cerebras-->>Explore: Response (~2100 tok/sec)
    Explore-->>Primary: Exploration results
    
    Primary->>Researcher: Delegate research task
    Researcher->>Cerebras: API call (free, fast)
    Cerebras-->>Researcher: Response
    Researcher-->>Primary: Research findings
    
    Primary->>Coder: Delegate code generation
    Coder->>Cerebras: API call (free, fast)
    Cerebras-->>Coder: Response
    Coder-->>Primary: Generated code
    
    Primary->>Security: Security review (needs thinking)
    Security->>Antigravity: API call (quota used)
    Antigravity-->>Security: Security analysis
    Security-->>Primary: Security report
    
    Primary->>User: Complete response
    
    Note over Primary,Cerebras: Result: Reduced Antigravity quota usage<br/>Most API calls go to free Cerebras<br/>Complex reasoning still uses Claude thinking
Loading

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new documentation guide explaining how to set up a multi-agent architecture to reduce rate limit pressure by using Antigravity only for complex tasks and free providers (like Cerebras) for faster subagent operations.

Changes:

  • Added docs/MULTI-AGENT-SETUP.md with architecture diagram, model tier recommendations, and example agent configurations
  • Documented how to balance between Antigravity's premium models and free third-party providers for optimal rate limit management

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

```markdown
<!-- ~/.config/opencode/agent/security-auditor.md -->
---
model: google/antigravity-claude-sonnet-4-5
Copy link

Copilot AI Jan 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The section header states "needs thinking" but the model configuration uses google/antigravity-claude-sonnet-4-5 which is the non-thinking variant. According to the README, antigravity-claude-sonnet-4-5-thinking is the model with extended thinking capabilities. Either update the model to use the -thinking variant, or remove the "needs thinking" comment from the header to match the actual model being used.

Suggested change
model: google/antigravity-claude-sonnet-4-5
model: google/antigravity-claude-sonnet-4-5-thinking

Copilot uses AI. Check for mistakes.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed - added -thinking suffix to the security auditor model.


| Tier | Provider | Models | Use Case |
|------|----------|--------|----------|
| Heavy | Antigravity | claude-opus-4-5-thinking, claude-sonnet-4-5-thinking | Complex reasoning, architecture |
Copy link

Copilot AI Jan 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The model names in the table lack the antigravity- prefix that is required for Antigravity models. Based on the configurations shown in the examples and the README, these should be antigravity-claude-opus-4-5-thinking and antigravity-claude-sonnet-4-5-thinking to be consistent with actual model identifiers used in the codebase.

Suggested change
| Heavy | Antigravity | claude-opus-4-5-thinking, claude-sonnet-4-5-thinking | Complex reasoning, architecture |
| Heavy | Antigravity | antigravity-claude-opus-4-5-thinking, antigravity-claude-sonnet-4-5-thinking | Complex reasoning, architecture |

Copilot uses AI. Check for mistakes.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed - added antigravity- prefix to model names in the table.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 3

🤖 Fix all issues with AI agents
In `@docs/MULTI-AGENT-SETUP.md`:
- Around line 7-13: Add a language specifier to the markdown code fence around
the architecture diagram by changing the opening triple backticks to include a
language (e.g., ```text); update the code block that starts with "Primary Agent
(Antigravity Claude Opus)" so the block is fenced as ```text (or another
appropriate language) to satisfy markdownlint and ensure correct rendering.
- Around line 15-21: Update the Security Auditor agent's model string to use the
verified "thinking" variant: replace the current model value
"google/antigravity-claude-sonnet-4-5" with
"google/antigravity-claude-sonnet-4-5-thinking" in the Security Auditor agent
configuration (the model: field for the Security Auditor entry) so it matches
the Model Tiers table and ANTIGRAVITY_API_SPEC.md.
- Around line 73-76: The "OpenCode Agent Docs" link currently points to a 404
URL; update the markdown entry that references "OpenCode Agent Docs" (the link
text with URL https://opencode.ai/docs/customization/agents) to the correct
OpenCode agents customization documentation URL by replacing the broken href
with the valid docs URL (verify the correct destination on opencode.ai or GitHub
and update the link target accordingly), ensuring the adjacent GitHub issue
reference [OpenCode `#7138`] remains unchanged.
📜 Review details

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Cache: Disabled due to data retention organization setting

Knowledge base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between eec6f46 and 3b05339.

📒 Files selected for processing (1)
  • docs/MULTI-AGENT-SETUP.md
🧰 Additional context used
🪛 markdownlint-cli2 (0.18.1)
docs/MULTI-AGENT-SETUP.md

7-7: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)
  • GitHub Check: Upload results
  • GitHub Check: Greptile Review
🔇 Additional comments (4)
docs/MULTI-AGENT-SETUP.md (4)

1-4: LGTM! Clear and concise introduction.

The title and introduction effectively communicate the guide's purpose and align with the PR objectives.


22-34: LGTM! Clear primary agent configuration.

The example demonstrates the mode: primary configuration with Antigravity Claude Opus as specified in the PR objectives.


36-49: LGTM! Excellent subagent configuration.

The example properly demonstrates mode: subagent with Cerebras as a free provider, and appropriately restricts tools to read-only operations for exploration tasks.


66-72: Correct the Cerebras throughput specification.

Line 69 claims Cerebras provides "8000+ tokens/sec" throughput, but current specs show Cerebras Llama 3.3 (70B) actually delivers approximately 2,100–2,600 tokens/second. The "free" claim is accurate; update the speed figure to reflect actual performance.

✏️ Tip: You can disable this entire section by setting review_details to false in your review settings.

@tctinh tctinh changed the base branch from main to dev January 22, 2026 05:55
@NoeFabris
Copy link
Owner

Thanks for the contribution! This is outside the scope of this plugin—we focus on the Antigravity auth layer, not general OpenCode configuration patterns. Feel free to post this idea in the Discussions section where it might get more visibility from the community.

@NoeFabris NoeFabris closed this Jan 28, 2026
@chindris-mihai-alexandru
Copy link
Contributor Author

Thanks for the contribution! This is outside the scope of this plugin—we focus on the Antigravity auth layer, not general OpenCode configuration patterns. Feel free to post this idea in the Discussions section where it might get more visibility from the community.

ok, I see. will do!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

docs: Add multi-agent configuration guide with Antigravity + Cerebras cost optimization

2 participants