Harden issue-intent confidence handling: clearer INTENT suffix + lenient validation by Copilot · Pull Request #43344 · github/gh-aw

Copilot · 2026-07-04T08:05:59Z

The agent intermittently sends confidence as a numeric value (0.9, "0.95") instead of the required enum string. The validator was rejecting the entire message on this optional field, silently dropping valid tool calls.

Changes

Fix 1 — Clearer INTENT description suffix (`generate_safe_outputs_tools.cjs`)

Explicitly states both type and allowed values to reduce model confusion:

Before: confidence (LOW/MEDIUM/HIGH)
After:  confidence (string, exactly one of: LOW, MEDIUM, HIGH)

Fix 2 — `x-strip-on-error` lenient validation

Introduces x-strip-on-error to FieldValidation (following the existing x- extension convention like x-synonyms). When set, a failing optional enrichment field is stripped from the item rather than rejecting the whole message.

StripOnError bool \json:"x-strip-on-error,omitempty"`added to GoFieldValidation` struct
confidence and rationale in set_issue_type / set_issue_field marked with x-strip-on-error: true
validateItem in safe_output_type_validator.cjs deletes the field instead of accumulating an error when the flag is set
validateIssueIntentLabels made lenient for label-level confidence (non-enum values stripped) and rationale (non-string values stripped)

Net effect: a set_issue_type or set_issue_field call with "confidence": 0.9 now passes validation with the bad field dropped, rather than being silently lost.

…rip-on-error lenient validation - Fix 1: Strengthen INTENT description suffix in generate_safe_outputs_tools.cjs from 'confidence (LOW/MEDIUM/HIGH)' to 'confidence (string, exactly one of: LOW, MEDIUM, HIGH)' - Fix 2: Add StripOnError/x-strip-on-error to FieldValidation struct and mark confidence and rationale in set_issue_type and set_issue_field as lenient fields. When these optional enrichment fields fail validation, they are stripped from the item instead of rejecting the whole message. - Also make validateIssueIntentLabels lenient: invalid label confidence and non-string rationale are stripped rather than rejecting the whole item. - Update tests: mock config includes x-strip-on-error, new tests cover lenient behavior for numeric confidence values (0.9, '0.95') and non-string rationale. Closes #43164 Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

github-actions · 2026-07-04T08:28:24Z

🧠 Matt Pocock Skills Reviewer has completed the skills-based review. ✅

github-actions · 2026-07-04T08:28:28Z

✅ Test Quality Sentinel completed test quality analysis.

github-actions · 2026-07-04T08:28:29Z

✅ Design Decision Gate 🏗️ completed the design decision gate check.

No ADR enforcement needed: PR #43344 does not have the 'implementation' label and has only 16 new lines of code in business logic directories (threshold: 100).

github-actions · 2026-07-04T08:28:34Z

✅ PR Code Quality Reviewer completed the code quality review.

Copilot

Pull request overview

This PR hardens the “issue intent” safe-output validation pipeline so optional enrichment fields (notably confidence and rationale) don’t cause otherwise-valid tool calls to be rejected when the agent sends malformed values. It also updates the tool description suffix to reduce model confusion about the expected types/values.

Changes:

Clarifies the INTENT description suffix to explicitly state confidence is a string enum and rationale is a string with max length.
Introduces x-strip-on-error / StripOnError to allow invalid optional enrichment fields to be dropped instead of rejecting the entire item.
Makes add_labels label-level confidence/rationale handling lenient, and adds JS/Go test coverage for stripping behavior.

Show a summary per file

File	Description
pkg/workflow/safe_outputs_validation_config.go	Adds `StripOnError` to `FieldValidation` and enables it for intent enrichment fields.
pkg/workflow/safe_output_validation_config_test.go	Verifies `StripOnError` is enabled for `confidence`/`rationale` in issue-intent configs.
actions/setup/js/safe_output_type_validator.cjs	Implements strip-on-error behavior in `validateItem` and makes label intent validation lenient.
actions/setup/js/safe_output_type_validator.test.cjs	Adds tests ensuring invalid `confidence`/`rationale` are stripped (not rejected).
actions/setup/js/generate_safe_outputs_tools.cjs	Updates INTENT suffix text to be more explicit about type/allowed values.
actions/setup/js/generate_safe_outputs_tools.test.cjs	Updates expected INTENT suffix string in tests.

Review details

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 6/6 changed files
Comments generated: 1
Review effort level: Low

github-actions · 2026-07-04T08:36:14Z

🧪 Test Quality Sentinel Report

✅ Test Quality Score: 81/100 — Excellent

Analyzed 7 test(s): 7 design, 0 implementation, 0 violation(s).

📊 Metrics (7 tests)

Metric	Value
Analyzed	7 (Go: 1, JS: 6)
✅ Design	7 (100%)
⚠️ Implementation	0 (0%)
Edge/error coverage	5 (71%)
Duplicate clusters	0
Inflation	YES — safe_output_type_validator.test.cjs: 75 added vs 21 prod lines (3.6:1; justified, see note)
🚨 Violations	0

Test	File	Classification	Issues
`generate_safe_outputs_tools` intent suffix with feature flag	`generate_safe_outputs_tools.test.cjs`	design_test / behavioral_contract	None
`generate_safe_outputs_tools` intent suffix without feature flag	`generate_safe_outputs_tools.test.cjs`	design_test / behavioral_contract	None
`should strip invalid label confidence instead of rejecting`	`safe_output_type_validator.test.cjs`	design_test / behavioral_contract	None
`should strip non-string label rationale instead of rejecting`	`safe_output_type_validator.test.cjs`	design_test / behavioral_contract	None
`should strip invalid confidence (x-strip-on-error)`	`safe_output_type_validator.test.cjs`	design_test / behavioral_contract	None
`should strip non-string rationale (x-strip-on-error)`	`safe_output_type_validator.test.cjs`	design_test / behavioral_contract	None
`TestIssueIntentValidationFields` StripOnError assertions	`safe_output_validation_config_test.go`	design_test / behavioral_contract	None

i️ Inflation Note

safe_output_type_validator.test.cjs — 75 lines added to test vs 21 lines added to production (3.6:1, above the 2:1 threshold). This is justified: four new it() blocks each require an async dynamic import plus multiple expect assertions to fully cover the two new x-strip-on-error behaviors (confidence and rationale) across two tool types (set_issue_type/set_issue_field and add_labels with nested items). Edge cases include numeric strings, raw numbers, and object values. No concern.

Verdict

✅ Passed. 0% implementation tests (threshold: 30%). No guideline violations. Go test file carries the required //go:build !integration build tag. The four new JS tests directly exercise the x-strip-on-error contract for every affected tool and field combination; edge cases cover numeric strings, raw numbers, and object values as invalid inputs. The two updated test strings in generate_safe_outputs_tools.test.cjs correctly mirror the clarified INTENT suffix wording.

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

awmgmcpg

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "awmgmcpg"

See Network Configuration for more information.

🧪 Test quality analysis by Test Quality Sentinel · 35.2 AIC · ⌖ 13.4 AIC · ⊞ 6.8K · ◷
_{Comment /review to run again}

github-actions

✅ Test Quality Sentinel: 81/100 — Excellent. 0% implementation tests (threshold: 30%). No guideline violations.

github-actions

Skills-Based Review 🧠

Applied /diagnosing-bugs, /tdd, and /codebase-design — commenting with non-blocking suggestions.

📋 Key Themes & Highlights

Key Themes

Dual code paths: The x-strip-on-error logic is correctly implemented for item-level fields via the flag in validateItem, but the equivalent label-level stripping in validateIssueIntentLabels is a parallel, hand-rolled implementation that doesn't reference the flag. This duplication is a future maintenance risk.
Test granularity: Some new tests bundle two independent failure modes in a single it block, making failure output less diagnostic.
Cross-language naming bridge: The Go StripOnError / JS x-strip-on-error mapping is undocumented in the JSDoc typedef, which will puzzle future maintainers.

Positive Highlights

✅ The bug description is clear and the root cause (agent sends numeric confidence, validator rejects the whole message) is properly addressed — not just the symptom.
✅ The x-strip-on-error extension follows the existing x- naming convention — nice consistency.
✅ Strong test coverage added (+75 lines) covering both set_issue_type and set_issue_field for both numeric-string and raw-number confidence values.
✅ The INTENT description suffix improvement is precise and directly actionable for the model.
✅ Go and JS validation configs stay in sync — StripOnError added to Go struct and the test validates it's set.

🧠 Reviewed using Matt Pocock's skills by Matt Pocock Skills Reviewer · 63.8 AIC · ⌖ 8.02 AIC · ⊞ 6.6K
_{Comment /matt to run again}

Comments that could not be inline-anchored

actions/setup/js/safe_output_type_validator.cjs:723

[/diagnosing-bugs] The x-strip-on-error flag is accessed via bracket notation on the validation object (validation["x-strip-on-error"]), but FieldValidation JSDoc defines the property as x-strip-on-error (with hyphens). In strict TypeScript / closure-annotated JS, this might not align with the typed property. More importantly, there is no test asserting that normalizedItem retains the valid fields when stripping — only that confidence is undefined. Consider adding an asserti…

actions/setup/js/safe_output_type_validator.cjs:159

[/diagnosing-bugs] The label-level rationale/confidence stripping is handled directly in validateIssueIntentLabels by silently ignoring invalid values, but the same x-strip-on-error extension used for item-level fields is not applied here. This creates two separate code paths for the same concept. A regression in either path may be harder to spot because the logic is duplicated rather than unified.

<details>
<summary>💡 Why this matters</summary>

If someone later changes the l…

actions/setup/js/safe_output_type_validator.test.cjs:1025

[/tdd] The two new x-strip-on-error tests batch multiple distinct input scenarios ("0.95" numeric string and raw 0.9) into a single it block. Per the red-green-refactor discipline, each distinct failure mode deserves its own test so the failure output is immediately diagnostic.

<details>
<summary>💡 Suggested split</summary>

it(&#39;strips confidence when sent as a numeric string (&quot;0.95&quot;)&#39;, async () =&gt; {
  const result = validateItem({ type: &#39;set_issue_type&#39;, issue_type: &#39;Bug&#39;, …

</details>

<details><summary>actions/setup/js/safe_output_type_validator.cjs:217</summary>

**[/codebase-design]** The JSDoc for `x-strip-on-error` is placed in the `@typedef` comment block for `FieldValidationConfig` but the runtime checks the flag via `validation[&quot;x-strip-on-error&quot;]` at line 723. Because the hyphenated name cannot be a normal JS property key in the typed interface, two naming conventions now exist simultaneously — the JSON key `x-strip-on-error` and its Go mirror `StripOnError`. The JSDoc comment uses the JSON form, but downstream readers need to know the mapping.

…

</details>

<details><summary>actions/setup/js/safe_output_type_validator.test.cjs:395</summary>

**[/tdd]** The new label-strip tests cover `add_labels` but not `set_issue_type` or `set_issue_field` label entries. If those tools also accept a `labels` array with `confidence`/`rationale`, edge cases there are untested. If they do not, a comment clarifying that `validateIssueIntentLabels` is only called from `add_labels` would prevent future confusion.

&lt;details&gt;
&lt;summary&gt;💡 Clarification or additional test&lt;/summary&gt;

If `validateIssueIntentLabels` is used only by `add_labels`, document that…

</details>

github-actions

Review: Harden issue-intent confidence handling

Overall: The fix is correct, well-scoped, and safe. The root cause (model occasionally sending numeric confidence) is real, and the x-strip-on-error mechanism is a clean, extensible pattern to handle optional enrichment fields leniently.

What's good:

The Go ↔ JS config round-trip is consistent: StripOnError bool serialises to "x-strip-on-error" and the JS validator reads it with bracket access.
New tests cover all four cases (numeric string "0.95", raw number 0.9, non-string rationale, non-string confidence) across both top-level fields and label-level fields.
The description suffix change is minimal and unambiguous.
validateIssueIntentLabels is hardened inline (strip instead of reject) to match the new flag semantics.

Minor suggestions (non-blocking):
Two inline comments filed — both are style/readability improvements with no correctness impact.

🧵 Reviewed using Impeccable skills by Impeccable Skills Reviewer · 62.9 AIC · ⌖ 6.39 AIC · ⊞ 4.9K

github-actions · 2026-07-04T08:38:56Z

-      const rationale = normalizeIssueIntentRationale(label.rationale, options);
-      if (rationale) {
-        normalizedLabel.rationale = rationale;
+        // Strip non-string rationale instead of rejecting (optional enrichment field)


The empty-body if branch for non-string rationale is intentional but easy to misread as a coding error. Consider inverting the condition to make the "strip and continue" intent explicit:

if (typeof label.rationale === 'string') { const rationale = normalizeIssueIntentRationale(label.rationale, options); if (rationale) normalizedLabel.rationale = rationale; } // non-string rationale silently stripped (optional enrichment field)

This avoids the empty if body and makes the stripping behaviour self-evident.

@copilot please address this.

github-actions · 2026-07-04T08:38:56Z

-      errors.push(result.error);
+      // When x-strip-on-error is set, strip the invalid optional field instead of rejecting the item.
+      // This is used for enrichment-only fields like confidence and rationale.
+      if (validation["x-strip-on-error"]) {


The x-strip-on-error flag is accessed with bracket notation (validation["x-strip-on-error"]) here, but the JSDoc typedef (line 217) documents it as a property named x-strip-on-error. Bracket access is the only valid approach given the hyphen in the name — that's fine — but the typedef should explicitly note that the hyphen makes dot-access impossible, to avoid confusion:

/** * `@property` {boolean} ['x-strip-on-error'] - ... * (Bracket access only due to hyphen: validation['x-strip-on-error']) */

Also consider whether a camelCase alias (e.g. stripOnError) in the JS config would be cleaner than a hyphenated key — the Go struct already uses StripOnError with json:"x-strip-on-error" for serialisation.

@copilot please address this.

github-actions

Two medium-priority maintainability concerns; no blocking correctness bugs found

The stripping-instead-of-rejecting approach is sound for the immediate problem (numeric confidence values silently dropping valid tool calls). The Go struct addition is clean and the test coverage is adequate for the new paths.

Two issues worth addressing before the pattern proliferates:

x-strip-on-error has no guard against co-occurring with required (line 716) — currently safe, but the code will silently pass structurally invalid items if someone combines both flags in future config. A one-line guard or an assertion in tests would close the gap permanently.
Two independent stripping paths — validateIssueIntentLabels hardcodes stripping for label sub-fields while validateItem uses the schema flag. They work correctly today but will require parallel maintenance for any future behavioural change (e.g. adding a warning-on-strip log). This should be documented or unified.

🔎 Code quality review by PR Code Quality Reviewer · 110.3 AIC · ⌖ 6.53 AIC · ⊞ 5.4K
_{Comment /review to run again}

github-actions · 2026-07-04T08:40:22Z

-      errors.push(result.error);
+      // When x-strip-on-error is set, strip the invalid optional field instead of rejecting the item.
+      // This is used for enrichment-only fields like confidence and rationale.
+      if (validation["x-strip-on-error"]) {


x-strip-on-error silently suppresses required-field failures with no guard against misconfiguration. If a future field is ever marked both required: true and x-strip-on-error: true, this branch swallows the required-field error and passes a structurally invalid item — with no runtime warning.

💡 Details and suggested fix

validateField returns isValid: false for required-field missing errors. The new x-strip-on-error branch catches that return path identically to a type-mismatch error. Nothing prevents a config author from combining both flags in the future.

Add an explicit guard at the call site:

if (validation["x-strip-on-error"]) { if (validation.required) { // required-field failures are hard errors regardless of x-strip-on-error errors.push(result.error); } else { delete normalizedItem[fieldName]; } } else { errors.push(result.error); }

Alternatively, add a config-time assertion (e.g., in tests) that no field is simultaneously required and x-strip-on-error. The current config is safe, but the code provides no structural defence against future drift.

github-actions · 2026-07-04T08:40:22Z

-      const rationale = normalizeIssueIntentRationale(label.rationale, options);
-      if (rationale) {
-        normalizedLabel.rationale = rationale;
+        // Strip non-string rationale instead of rejecting (optional enrichment field)


Two parallel stripping paths for label sub-fields (validateIssueIntentLabels) and item-level fields (validateItem) are now structurally independent and will diverge. The hardcoded stripping here is not driven by the x-strip-on-error schema flag, so the flag set on labels items' confidence/rationale entries in the config has no effect on this code path.

💡 Details

validateIssueIntentLabels strips invalid rationale (line 150) and confidence (line 174) unconditionally via hardcoded logic. The generic validateItem path (line 716) strips based on the x-strip-on-error flag in the field schema.

These two paths are already inconsistent: any future change to the stripping behaviour — adding a log/warning on strip, or making stripping conditional on an option — will need to be applied in two places. The test at line 119 of the test file marks the labels array with x-strip-on-error: true, but validateIssueIntentLabels is the actual runtime code path for label validation and never checks that flag.

If the intent is a single authoritative strip-on-error mechanism, the hardcoded stripping in validateIssueIntentLabels should be made configurable (e.g. pass an options flag or read from the schema), or the JSDoc/comment should explicitly note that these are separate paths with separate maintenance requirements.

pelikhan · 2026-07-04T08:55:14Z

@copilot run pr-finisher skill

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

…el stripping Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2026-07-04T09:18:14Z

@copilot run pr-finisher skill

Ran the pr-finisher pass and pushed fixes in f22f7dd and 1dc3cb5.

Local validation is passing (make fmt, make lint, make test-unit, make test, make recompile). CI status for this new head is stale after agent pushes and needs a maintainer-triggered rerun.

Initial plan

1d8fe4c

Copilot AI assigned Copilot and pelikhan Jul 4, 2026

Copilot AI linked an issue Jul 4, 2026 that may be closed by this pull request

Harden issue-intent confidence handling: clearer suffix + lenient validation #43164

Closed

Copilot started work on behalf of pelikhan July 4, 2026 08:06 View session

Copilot AI and others added 2 commits July 4, 2026 08:20

Address review feedback: improve inline comments and JSDoc formatting

ecac8ae

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix issue with confidence handling to validate enum correctly~~ Harden issue-intent confidence handling: clearer INTENT suffix + lenient validation Jul 4, 2026

Copilot finished work on behalf of pelikhan July 4, 2026 08:22

Copilot AI requested a review from pelikhan July 4, 2026 08:22

pelikhan marked this pull request as ready for review July 4, 2026 08:27

Copilot AI review requested due to automatic review settings July 4, 2026 08:27

Copilot started reviewing on behalf of pelikhan July 4, 2026 08:28 View session

Copilot AI reviewed Jul 4, 2026

View reviewed changes

Comment thread actions/setup/js/safe_output_type_validator.cjs Outdated

github-actions Bot mentioned this pull request Jul 4, 2026

[aw] No-Op Runs #39849

Open

github-actions Bot approved these changes Jul 4, 2026

View reviewed changes

github-actions Bot reviewed Jul 4, 2026

View reviewed changes

Copilot started work on behalf of pelikhan July 4, 2026 08:55 View session

Copilot AI and others added 3 commits July 4, 2026 08:59

Plan PR finisher pass

bb48632

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Guard x-strip-on-error for required fields and clean issue-intent lab…

f22f7dd

…el stripping Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Polish rationale stripping comment for validator clarity

1dc3cb5

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan July 4, 2026 09:18

pelikhan merged commit 88fa8be into main Jul 4, 2026
37 checks passed

pelikhan deleted the copilot/harden-issue-intent-confidence-handling-again branch July 4, 2026 10:16

Uh oh!

Conversation

Copilot AI commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Fix 1 — Clearer INTENT description suffix (generate_safe_outputs_tools.cjs)

Fix 2 — x-strip-on-error lenient validation

Uh oh!

github-actions Bot commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Review details

Uh oh!

Uh oh!

github-actions Bot commented Jul 4, 2026

🧪 Test Quality Sentinel Report

Verdict

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Skills-Based Review 🧠

Key Themes

Positive Highlights

Comments that could not be inline-anchored

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Review: Harden issue-intent confidence handling

Uh oh!

github-actions Bot Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Two medium-priority maintainability concerns; no blocking correctness bugs found

Uh oh!

github-actions Bot Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

pelikhan commented Jul 4, 2026

Uh oh!

Copilot AI commented Jul 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Jul 4, 2026 •

edited

Loading

Fix 1 — Clearer INTENT description suffix (`generate_safe_outputs_tools.cjs`)

Fix 2 — `x-strip-on-error` lenient validation

github-actions Bot commented Jul 4, 2026 •

edited

Loading

github-actions Bot commented Jul 4, 2026 •

edited

Loading

github-actions Bot commented Jul 4, 2026 •

edited

Loading

github-actions Bot commented Jul 4, 2026 •

edited

Loading