You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since PR #256, trigger_pr_creation returns { partialFailure: true, failedRepos } instead of silently dropping repos, and the orchestrator prompt handles remediation/escalation. But post-hoc, an operator has nothing to diagnose with:
the failure detail exists only in the tool response (consumed by the LLM) and console.error in the ephemeral ECS task's CloudWatch stream
nothing is persisted to Neptune about WHICH repo failed and WHY
no metric/alarm fires — if the agent mishandles the response, the failure evaporates
Proposal
EMF metric MultiRepoPRPartialFailure (dimensions: projectId, repository, reason: conflict|mergeError|other) emitted from the MCP server
Problem
Since PR #256,
trigger_pr_creationreturns{ partialFailure: true, failedRepos }instead of silently dropping repos, and the orchestrator prompt handles remediation/escalation. But post-hoc, an operator has nothing to diagnose with:console.errorin the ephemeral ECS task's CloudWatch streamProposal
MultiRepoPRPartialFailure(dimensions: projectId, repository, reason: conflict|mergeError|other) emitted from the MCP serverfailed_repos(JSON) on the PRGroup vertex next to theexpected_reposproperty PR feat(multi-repo): complete project repository management UI #256 added, so the UI can render "PR missing for repo X: merge conflict on branch Y"expected_reposvs GROUPS edges — the reconciliation data model already supports this)Refs: PR #256 review finding M6 (#256 (comment))