Testing Strategy
Test Pyramid
Test Execution
Local Testing:Test Markers
unit: Fast, isolated unit testsintegration: Tests requiring external servicesproperty: Property-based tests with Hypothesiscontract: MCP protocol compliance testsregression: Performance regression testsbenchmark: Performance benchmarksmutation: Mutation testing (weekly schedule)
Mutation Testing
Purpose: Measure test effectiveness Schedule: Weekly (too slow for every PR) Command:Benchmark Testing
Purpose: Track performance regressions Execution: Every PR + weekly Thresholds:- Agent response: p95 < 5s
- LLM call: p95 < 10s
- Authorization: p95 < 50ms