Plan. Work. Review. Ship.
A disciplined delivery loop for Claude Code, with bounded paths for Codex and OpenCode.
English | 日本語
Claude Code is powerful, but raw agent work drifts: plans live in chat, tests become optional, review happens too late, and release evidence gets rebuilt by memory. Harness turns that into one repeatable operating path.
After install, the default changes from "ask the agent to code" to:
- write the spec and plan,
- implement only the approved slice,
- verify the result,
- review independently,
- package evidence for PR or release.
New users should start from the tool they already use. Existing users should run the migration report before cleanup or reinstall.
| Path | Start |
|---|---|
| New user | Tool-first onboarding |
| Existing user | Migration check |
| Claude Code fast path | Install in 30 seconds |
| Trigger proof | Skill trigger gate |
claude
/plugin marketplace add Chachamaru127/claude-code-harness
/plugin install claude-code-harness@claude-code-harness-marketplace
/harness-setupNext command: run /harness-plan with one small request.
/harness-plan Improve the README onboarding flow- Install through your tool route.
- Run
/harness-setupor the equivalent setup script. - Run
/harness-planwith a small request; Harness writes thespec.mdandPlans.mddrafts for you to check. Small typo, docs, and status updates stay lightweight. - Approve the generated contract or reply with the correction you want.
- Run the smallest approved task, for example
/harness-work 1.1.1. - Run
/harness-reviewand keep the verification output.
Your job is not to hand-write the plan. It is to approve or correct the generated contract before execution continues.
Harness adds a source-of-truth loop around agent work. The 5 verb skills keep that surface small: plan, work, review, sync, release.
- You describe the outcome in normal language.
/harness-plandrafts or updatesspec.mdandPlans.mdwith scope, acceptance criteria, unknowns, and stop conditions.- Non-trivial planning records
team_validation_modeand validates the plan through team/sub-agent or manual-pass perspectives for spec/Plans alignment, memory reuse, product fit, security fit, and works-in-practice. - Harness treats those files as the source of truth. Data the agent has not
seen stays
unknowninstead of being silently invented. /harness-workimplements the approved slice with TDD and verification./harness-reviewseparates review from implementation./harness-releasepackages only verified evidence.
| Command | What happens inside |
|---|---|
/harness-setup |
Installs project guidance, command surfaces, hooks, and checks so the workflow starts from one known baseline. |
/harness-plan |
Turns intent into spec.md and Plans.md, including scope, acceptance criteria, dependencies, unknowns, stop conditions, and non-trivial planning validation. |
/harness-work |
Executes one approved task or range, adds tests when required, runs verification, and keeps work inside the plan. |
/harness-work all |
Runs the approved plan through implementation and review paths; use after the plan is clear and the repo baseline is known. |
/harness-review |
Reviews the result separately from implementation and treats major findings as blockers. |
/harness-release |
Checks release readiness, CHANGELOG/tag boundaries, and evidence packaging after implementation and review are complete. |
bin/harness doctor --migration-report |
Inventories old plugin caches, Codex skills, OpenCode files, symlinks, and memory state without deleting data. |
| Stage | Output | Gate |
|---|---|---|
| Investigate | Evidence and unknowns | Do not promote unobserved data into claims. |
| Plan | spec.md + Plans.md |
User approves or corrects the generated contract. |
| Work | Code and tests | TDD required when the task says so. |
| Review | Independent verdict | Major findings block completion. |
| PR | Evidence pack | PR ready is not release ready. |
| Release | Tag/release artifacts | Release preflight must pass on the release path. |
| Tool | Tier | Route |
|---|---|---|
| Claude Code | supported |
Claude plugin marketplace, then /harness-setup. |
| Codex CLI | internal-compatible |
scripts/setup-codex.sh --user; direct plugin smoke is tracked separately. |
| Codex app | candidate |
Candidate smoke only; do not reuse Codex CLI proof. |
| OpenCode | internal-compatible |
scripts/setup-opencode.sh; runtime parity is not claimed. |
| Cursor | internal-compatible |
scripts/setup-cursor.sh real-directory local install; top support tier still gated on workflow smoke. |
| GitHub Copilot CLI | candidate |
Manual profile research only. |
| Antigravity CLI | future/unsupported |
No end-user install route in this phase. |
Run bin/harness doctor --migration-report before changing an existing setup.
The report inventories stale Claude plugin caches, duplicate Codex skills, old
symlinks, OpenCode backup paths, and harness-mem state without deleting
anything.
Harness can describe candidate paths, but it does not inherit support claims from Superpowers, Hermes Agent, or any other project. A host only moves up when Harness has its own bootstrap, trigger, runtime, and release evidence.
not_observed != absent: missing local proof means "not proven here", not
"impossible" and not "supported".
- Claude Code v2.1+ for the supported Claude path.
- A project repository with write access for local setup.
- No Node.js is required for the Go-native guardrail engine.
- Optional harness-mem for cross-session memory when configured and healthy.
Use these after the basic trigger path is visible.
| Capability | What it adds | Boundary |
|---|---|---|
| Breezing | Planner/Critic/Worker style team execution for larger task lists. | Still gated by plan quality and review. |
| Codex companion review | Schema-backed Codex second opinion through scripts/codex-companion.sh. |
Raw codex exec is not the Harness companion path. |
| OpenCode bootstrap | Mirrors Harness guidance into OpenCode-compatible surfaces. | Real runtime parity is not claimed. |
| harness-mem | Project-scoped memory and recall across sessions. | Optional companion; purge remains explicit. |
| Resource | Description |
|---|---|
| Tool-first onboarding | Where to start by host tool. |
| Install routes | Per-tool setup and support-tier boundaries. |
| Migration check | Existing-user impact, compatibility, and rollback path. |
| Skill trigger gate | How install success is verified. |
| Capability matrix | Supported, internal-compatible, candidate, and unsupported host claims. |
| Claude Code Compatibility | Current Claude Code requirements and compatibility notes. |
| Cursor Integration | Cursor handoff boundary and internal-compatible adapter notes. |
| Distribution Scope | Included vs compatibility vs development-only paths. |
| Hardening parity | Runtime safety differences between Claude hooks and Codex gates. |
| Work All Evidence Pack | Success/failure verification contract for full-plan execution. |
| Changelog | User-facing version history. |
Issues and PRs welcome. See CONTRIBUTING.md.
MIT License. See LICENSE.md.

