Skip to content

docs(eval): v6 results + the pr_review→enforce promotion-criterion ADR #3849

@williamzujkowski

Description

@williamzujkowski

Part of #3845. Depends on #3847, #3848, #3840 (ladder ADR's evidence-threshold schema).

Context

Run the panel over the n≥50 set; publish v6 (per-voter tables, deltas vs v5, honest FP analysis under the rubric). Define the promotion criterion per the Epic D evidence schema (e.g. sustained precision ≥ X over a sliding window of N live advisory reviews + eval ≥ Y) and encode it in the claims registry so claims:check tracks it.

Acceptance criteria

  • v6 doc with per-voter precision/recall
  • Promotion-criterion ADR (schema-conformant; ratification path = Epic D vote)
  • Claims registry: '100% bug-catch (v5, n=10)' superseded by the v6 entry

Evidence required

v6 artifact; registry diff; docs gates.

Out of scope

The promotion itself (a future ratified Epic D transition).

Metadata

Metadata

Assignees

No one assigned

    Labels

    evalEval sets, per-voter precision (Epic E)p2Priority 2 - Medium impact, moderate changes neededroadmap:control-planeControl Plane roadmap (M1-M4)

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions