Reduce tool denial failures in three high-frequency agentic workflows by Copilot · Pull Request #42273 · github/gh-aw

Copilot · 2026-06-29T15:28:01Z

Three daily/weekly workflows (daily-safe-output-integrator, daily-formal-spec-verifier, layout-spec-maintainer) were routinely hitting the 5/5 tool denial limit before emitting any safe output, causing complete run loss.

Changes

Scope reduction (batch size limits)

Safe Output Integrator: cap at 20 missing types per run; note remainder for the next run
Layout Spec Maintainer: cap at 50 lock files total across all category queries per run

Early emission

Layout Spec Maintainer: emit create-pull-request immediately after the scan — do not defer to gather more data

Tool budget awareness (all three workflows)

Added an explicit instruction to each workflow's constraints/guardrails:

If you are approaching the tool call limit, emit a partial result immediately rather than continuing to gather more data.

For Formal Spec Verifier, the instruction includes a priority ordering to resolve ambiguity between two fallback modes: emit partial create_issue if formalization is substantially complete (≥5 predicates, ≥3 test functions), otherwise emit report_incomplete immediately.

Tests

Added one test per workflow asserting the budget-awareness and batch-limit guidance is present in the source:

TestDailySafeOutputIntegratorHasToolBudgetAwareness
TestDailyFormalSpecVerifierHasToolBudgetAwareness
TestLayoutSpecMaintainerHasToolBudgetAwareness

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds explicit tool-budget guidance to multiple GitHub Actions workflow prompt files and introduces Go tests that assert these instructions are present, helping ensure agents emit partial output instead of failing silently when tool-call limits are reached.

Changes:

Add “tool budget awareness” and batch-size/early-emission instructions to workflow prompt markdown files.
Add Go tests that read workflow prompt files and assert required budget-related guidance is included.
Update workflow lock metadata hashes to reflect prompt content changes.

Show a summary per file

File	Description
pkg/workflow/prompts_test.go	Adds assertions that specific workflow prompts include tool-budget guidance.
pkg/workflow/daily_safe_output_integrator_workflow_test.go	Adds a test to verify the safe-output integrator prompt includes budget/batch guidance.
.github/workflows/layout-spec-maintainer.md	Documents new lockfile scan batch limit, early emission, and tool-budget awareness.
.github/workflows/layout-spec-maintainer.lock.yml	Updates generated lock metadata hashes after prompt changes.
.github/workflows/daily-safe-output-integrator.md	Documents batch-size and tool-budget awareness for integrator runs.
.github/workflows/daily-safe-output-integrator.lock.yml	Updates generated lock metadata hashes after prompt changes.
.github/workflows/daily-formal-spec-verifier.md	Adds tool-budget awareness instruction with a fallback order.
.github/workflows/daily-formal-spec-verifier.lock.yml	Updates generated lock metadata hashes after prompt changes.

Review details

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 8/8 changed files
Comments generated: 4
Review effort level: Low

 9. **Sort alphabetically**: Keep lists organized and easy to scan
 10. **Real SHA values**: When listing actions, use actual commit SHAs found
+11. **Batch size**: When scanning lock files for patterns, run at most **50 lock files total** across all category queries (actions, artifacts, job names, paths) combined per run. If there are more than 50 lock files, sample the first 50 and note the remainder in the PR body.
+12. **Early emission**: After completing the scan and generating the document, emit the `create-pull-request` safe output immediately. Do not delay to gather additional data beyond the batch limit.


 )

+func TestDailySafeOutputIntegratorHasToolBudgetAwareness(t *testing.T) {
+	sourceContent, err := os.ReadFile("../../.github/workflows/daily-safe-output-integrator.md")


+	if !strings.Contains(workflow, "If you are approaching the tool call limit, emit a partial result immediately rather than continuing to gather more data.") {
+		t.Fatal("Expected daily-formal-spec-verifier workflow to contain tool budget awareness instruction")
+	}


+func TestDailyFormalSpecVerifierHasToolBudgetAwareness(t *testing.T) {
+	repoRoot, err := findRepoRoot()
+	if err != nil {
+		t.Fatalf("Failed to find repo root: %v", err)
+	}
+
+	workflowFile := filepath.Join(repoRoot, ".github", "workflows", "daily-formal-spec-verifier.md")
+	content, err := os.ReadFile(workflowFile)
+	if err != nil {
+		t.Fatalf("Failed to read workflow file: %v", err)
+	}


github-actions · 2026-06-29T17:08:42Z

🎉 This pull request is included in a new release.

Release: v0.82.0

Initial plan

12db83d

Copilot AI assigned Copilot and pelikhan Jun 29, 2026

Copilot started work on behalf of pelikhan June 29, 2026 15:28 View session

Copilot AI linked an issue Jun 29, 2026 that may be closed by this pull request

[systemic] Multiple agents hitting tool denial limit — structural complexity reduction needed #42258

Closed

Fix systemic tool denial limit failures in 3 agentic workflows

3288f27

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Reduce structural complexity to resolve tool denial limit~~ Reduce tool denial failures in three high-frequency agentic workflows Jun 29, 2026

Copilot finished work on behalf of pelikhan June 29, 2026 15:45

Copilot AI requested a review from pelikhan June 29, 2026 15:45

pelikhan marked this pull request as ready for review June 29, 2026 16:17

Copilot AI review requested due to automatic review settings June 29, 2026 16:17

pelikhan merged commit 7e2ace0 into main Jun 29, 2026

pelikhan deleted the copilot/gh-aw-42125-reduce-structural-complexity branch June 29, 2026 16:17

Copilot stopped reviewing on behalf of pelikhan due to an error June 29, 2026 16:17
Failed to launch agent

Copilot AI reviewed Jun 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce tool denial failures in three high-frequency agentic workflows#42273

Reduce tool denial failures in three high-frequency agentic workflows#42273
pelikhan merged 2 commits into
mainfrom
copilot/gh-aw-42125-reduce-structural-complexity

Copilot AI commented Jun 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

github-actions Bot commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Copilot AI commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Scope reduction (batch size limits)

Early emission

Tool budget awareness (all three workflows)

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Review details

Uh oh!

github-actions Bot commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Jun 29, 2026 •

edited

Loading