Context
SPEC-03's test scoring includes test_score *= (1.0 + coverage_delta * 0.5) to reward mutations that increase code coverage.
Current Behavior
discover_eval_config has a coverage_command field but it's never populated or used. Test scoring is binary: all pass = 1.0, any fail = 0.0.
Expected Behavior
Capture baseline coverage percentage, run coverage after mutation, compute delta, and apply the multiplier to the test score.
Impact
- No incentive for mutations to improve coverage
- Mutations that add tests without improving coverage score the same as those that do
References
atlas-specs/03-EVALUATION.md — coverage delta formula
atlas/evaluation/discovery.py — coverage_command field (always None)
atlas/evaluation/dimensions.py — score_tests has no coverage logic
Context
SPEC-03's test scoring includes
test_score *= (1.0 + coverage_delta * 0.5)to reward mutations that increase code coverage.Current Behavior
discover_eval_confighas acoverage_commandfield but it's never populated or used. Test scoring is binary: all pass = 1.0, any fail = 0.0.Expected Behavior
Capture baseline coverage percentage, run coverage after mutation, compute delta, and apply the multiplier to the test score.
Impact
References
atlas-specs/03-EVALUATION.md— coverage delta formulaatlas/evaluation/discovery.py—coverage_commandfield (always None)atlas/evaluation/dimensions.py—score_testshas no coverage logic