Skip to content

fix(eval): ground LLM judge with command reference to prevent false negatives#712

Merged
BYK merged 6 commits intomainfrom
fix/eval-skill-judge-context
Apr 10, 2026
Merged

fix(eval): ground LLM judge with command reference to prevent false negatives#712
BYK merged 6 commits intomainfrom
fix/eval-skill-judge-context

Commits

Commits on Apr 10, 2026