MCP Mastery
About

Eval Writing Labs

Four offline arcs: rubric scoring, pairwise preference, slice regressions, and rollout gates. Each is Python-only, runnable with uv or pytest, and test-backed so the happy path cannot cosplay as evidence.