- Category: Testing / AI DevTools
- Language: Go
- Date: 2026-03-28
- Tests: 134
- Status: Build complete
YAML-driven behavioral testing framework for AI agents. Verifies tool calls, sequences, and constraints against execution logs. Decoupled from LLMs -- reads JSON execution logs, not API responses.
internal/spec/-- YAML parser, types, validatorinternal/matcher/-- 5 matching modes (exact, contains, regex, schema, any)internal/runner/-- Sequential and parallel executioninternal/reporter/-- Table and JSON outputinternal/loader/-- Execution log loadercmd/agentspec/-- CLI (cobra)
- Decoupled from LLMs -- Tests run against execution logs, not live agents. This makes tests deterministic and fast.
- YAML specs -- Familiar format for defining expected behavior. Supports nested matchers.
- 5 matching modes -- From strict (exact) to permissive (any), covering real-world needs.
- Constraint system --
no_tool,max_tools,orderedlet you define what agents should NOT do. - Auto log discovery --
spec.yamlautomatically pairs withspec.log.json.