feat(core): repair malformed llm grader output by christso · Pull Request #933 · EntityProcess/agentv

christso · 2026-04-04T09:24:02Z

Closes #911

Summary

add a final structure-repair retry for llm-grader after the 3 standard attempts fail on malformed structured output
reuse the last invalid grader response plus validation error instead of re-grading from scratch
skip the repair path when the grader returned no content to salvage
document the AgentV OSS board claim workflow fix in AGENTS.md so missing project items are added before status updates

Verification

bun test packages/core/test/evaluation/evaluators.test.ts packages/core/test/evaluation/evaluators_variables.test.ts packages/core/test/evaluation/orchestrator.test.ts
pre-push hook passed: build, typecheck, lint, test, validate eval YAML files

Red/Green UAT

Red on main:

bun apps/cli/src/cli.ts eval /tmp/agentv-911-redgreen/repair.eval.yaml --target candidate_mock --output /tmp/agentv-911-redgreen/main.red.jsonl
result: repair-check was skipped with Grader parse failure after 3 attempts, and /tmp/agentv-911-redgreen/main.red.jsonl recorded execution_status: execution_error with score: 0

Green on this branch:

bun apps/cli/dist/cli.js eval /tmp/agentv-911-redgreen/repair.eval.yaml --target candidate_mock --output /tmp/agentv-911-redgreen/branch.green.jsonl
result: the same eval passed with score: 1 after the grader script received the structure-repair prompt, and /tmp/agentv-911-redgreen/branch.green.jsonl recorded execution_status: ok

cloudflare-workers-and-pages · 2026-04-04T09:24:32Z

Deploying agentv with Cloudflare Pages

Latest commit:	`321f3e7`
Status:	✅ Deploy successful!
Preview URL:	https://a95c6d91.agentv.pages.dev
Branch Preview URL:	https://feat-911-smart-llm-grader-re.agentv.pages.dev

View logs

christso added 4 commits April 4, 2026 09:19

feat(core): repair malformed llm grader output

7a040d6

feat(core): add llm grader structure repair retry

6346f0a

style(test): format llm grader retry test

ed0a102

fix(core): skip structure repair when grader returns no content

321f3e7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): repair malformed llm grader output#933

feat(core): repair malformed llm grader output#933
christso wants to merge 4 commits intomainfrom
feat/911-smart-llm-grader-retry

christso commented Apr 4, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

christso commented Apr 4, 2026

Summary

Verification

Red/Green UAT

Uh oh!

cloudflare-workers-and-pages bot commented Apr 4, 2026

Deploying agentv with Cloudflare Pages

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant