Deterministic evaluation case for one Jidoka turn.
Eval cases are ordinary data: an agent spec, a turn request, and lightweight assertions that can run against fake or live capabilities supplied by the caller.
Deterministic evaluation case for one Jidoka turn.
Eval cases are ordinary data: an agent spec, a turn request, and lightweight assertions that can run against fake or live capabilities supplied by the caller.