refactor(sdk): extract LLM response classification and dispatch from Agent.step() by VascoSch92 · Pull Request #2743 · OpenHands/software-agent-sdk

VascoSch92 · 2026-04-07T14:25:45Z

Summary

Extracts the monolithic LLM response handling block from Agent.step() into a pure classifier function and a dispatch mixin, following the existing CriticMixin pattern.

What changed

New file response_dispatch.py containing:
LLMResponseType (StrEnum) — classifies responses into TOOL_CALLS, CONTENT, REASONING_ONLY, EMPTY
classify_response() — pure function, no side effects, unit-testable without mocking the agent
_AgentProtocol — typed Protocol documenting the mixin's contract with its host class
ResponseDispatchMixin — handler methods for each response type, mixed into Agent
agent.py — the ~85-line if/else chain in step() replaced by a 10-line match dispatch. Agent now inherits from ResponseDispatchMixin. Net reduction of ~70 lines.
New test file test_response_dispatch.py — 15 parametrized classifier tests covering all 8 rows of the response matrix + edge cases, plus 3 mixin integration tests verifying dispatch behavior (FINISHED status, corrective nudge).

What did NOT change

All existing behavior is preserved exactly. No new features, no bug fixes — this is a pure structural refactoring.

Motivation

classify_response() is independently testable with zero mocks (previously required full agent loop)
Each response type has a dedicated handler — adding new types (e.g. reasoning promotion for [Bug] Remote conversation got stuck: monologue detector false positive on extended thinking models #2482) is additive
_AgentProtocol makes the mixin's dependencies explicit instead of relying on duck typing
step() is now ~15 lines of dispatch instead of ~85 lines of branching

Checklist

If the PR is changing/adding functionality, are there tests to reflect this?
If there is an example, have you run the example to make sure that it works?
If there are instructions on how to run the code, have you followed the instructions and made sure that it works?
If the feature is significant enough to require documentation, is there a PR open on the OpenHands/docs repository with the same branch name?
Is the github CI passing?

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.13-nodejs22-slim`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:95e3a6b-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-95e3a6b-python \
  ghcr.io/openhands/agent-server:95e3a6b-python

All tags pushed for this build

ghcr.io/openhands/agent-server:95e3a6b-golang-amd64
ghcr.io/openhands/agent-server:95e3a6b-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:95e3a6b-golang-arm64
ghcr.io/openhands/agent-server:95e3a6b-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:95e3a6b-java-amd64
ghcr.io/openhands/agent-server:95e3a6b-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:95e3a6b-java-arm64
ghcr.io/openhands/agent-server:95e3a6b-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:95e3a6b-python-amd64
ghcr.io/openhands/agent-server:95e3a6b-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-amd64
ghcr.io/openhands/agent-server:95e3a6b-python-arm64
ghcr.io/openhands/agent-server:95e3a6b-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-arm64
ghcr.io/openhands/agent-server:95e3a6b-golang
ghcr.io/openhands/agent-server:95e3a6b-java
ghcr.io/openhands/agent-server:95e3a6b-python

About Multi-Architecture Support

Each variant tag (e.g., 95e3a6b-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., 95e3a6b-python-amd64) are also available if needed

github-actions · 2026-04-07T14:26:25Z

Python API breakage checks — ✅ PASSED

Result: ✅ PASSED

Action log

github-actions · 2026-04-07T14:26:29Z

REST API breakage checks (OpenAPI) — ✅ PASSED

Result: ✅ PASSED

Action log

github-actions · 2026-04-07T14:28:20Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-sdk/openhands/sdk/agent
agent.py	299	20	93%	99, 280, 284, 483–485, 487, 517–518, 525–526, 614, 879–880, 882, 911, 919–920, 954, 961
response_dispatch.py	64	4	93%	190, 275–277
TOTAL	22584	6467	71%

all-hands-bot

🟢 Good taste - Textbook refactoring. Reduces 85 lines of branching to 10 lines of clean dispatch. Pure classifier is independently testable. Tests are real (not mocks). No technical issues found.

Note: Per repo policy, agent loop changes require human maintainer eval verification before merge (previously flagged and acknowledged).

all-hands-bot

🟢 Good taste - Textbook refactoring. Reduces 85 lines of branching to 10 lines of clean dispatch. Pure classifier is independently testable. Tests are real (not mocks). No technical issues found.

Note: Per repo policy, agent loop changes require human maintainer eval verification before merge (previously flagged and acknowledged).

all-hands-bot · 2026-04-13T12:41:56Z

[Automatic Post]: It has been a while since there was any activity on this PR. @VascoSch92, are you still working on it? If so, please go ahead, if not then please request review, close it, or request that someone else follow up.

enyst · 2026-04-13T20:59:07Z

@OpenHands /codereview-roasted on this pr, post your feedback as a review with gh api and event.

openhands-ai · 2026-04-13T20:59:23Z

I'm on it! enyst can track my progress at all-hands.dev

enyst

🟢 Taste Rating: Good taste

This is the right kind of refactor. The data flow is simpler now: classify once, dispatch once, keep the behavior in small handlers instead of burying it in one long branch pile. No new special cases, no API churn, and the tests exercise the real Agent.step() path instead of only asserting that mocks were called.

I reviewed the diff against main, checked the new classifier/dispatch split, and ran:

uv run pytest tests/sdk/agent/test_response_dispatch.py

Result: 19 passed.

Because this change touches Agent.step() and response/tool-dispatch control flow, I'm leaving a COMMENT review rather than an approval; per repo policy, a human maintainer should make the final call after lightweight eval/CI confidence.

VERDICT: ✅ Worth merging pending human eval-risk sign-off

KEY INSIGHT: The refactor improves the structure by making response classification the single decision point, while keeping behavior local to dedicated handlers and preserving the existing control flow.

AI disclosure: This review was generated by an AI assistant (OpenHands) on behalf of the user.

openhands-ai · 2026-04-13T21:01:41Z

Final summary of new work since the last summary:

I completed the requested PR review workflow for PR refactor(sdk): extract LLM response classification and dispatch from Agent.step() #2743.
I fetched and reviewed the PR metadata, body, diff, and linked-issue context.
I inspected the changed files:
- openhands-sdk/openhands/sdk/agent/agent.py
- openhands-sdk/openhands/sdk/agent/response_dispatch.py
- tests/sdk/agent/test_response_dispatch.py
I ran the targeted test suite:
- uv run pytest tests/sdk/agent/test_response_dispatch.py
- Result: 19 passed
I posted the requested GitHub review as a COMMENT review via gh api:
- refactor(sdk): extract LLM response classification and dispatch from Agent.step() #2743 (review)

Checklist:

Reviewed the PR in the requested /codereview-roasted style
Used GitHub API / gh api to post the feedback as a review with COMMENT event
Verified the relevant targeted tests pass
Made no code changes

Conciseness:

No repository files were modified
No extraneous changes were introduced

Status:

The request was completely addressed
All instructions were followed faithfully

VascoSch92 · 2026-04-14T11:09:02Z

@enyst I think we are good to merge or? :-)

refactoring

3523ae0

VascoSch92 requested a review from all-hands-bot April 7, 2026 14:25

This comment was marked as outdated.

Sign in to view

address comments

1840900

VascoSch92 requested a review from all-hands-bot April 7, 2026 15:24

This comment was marked as outdated.

Sign in to view

feedbacks

068201f

VascoSch92 requested a review from all-hands-bot April 7, 2026 16:05

all-hands-bot reviewed Apr 7, 2026

View reviewed changes

VascoSch92 marked this pull request as ready for review April 7, 2026 16:18

all-hands-bot reviewed Apr 7, 2026

View reviewed changes

Merge branch 'main' into vasco/refactoring

95e3a6b

enyst reviewed Apr 13, 2026

View reviewed changes

Conversation

VascoSch92 commented Apr 7, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

What did NOT change

Motivation

Checklist

Uh oh!

github-actions bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python API breakage checks — ✅ PASSED

Uh oh!

github-actions bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

REST API breakage checks (OpenAPI) — ✅ PASSED

Uh oh!

github-actions bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

all-hands-bot commented Apr 13, 2026

Uh oh!

enyst commented Apr 13, 2026

Uh oh!

openhands-ai bot commented Apr 13, 2026

Uh oh!

enyst left a comment

Choose a reason for hiding this comment

Uh oh!

openhands-ai bot commented Apr 13, 2026

Uh oh!

VascoSch92 commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

VascoSch92 commented Apr 7, 2026 •

edited by github-actions bot

Loading

github-actions bot commented Apr 7, 2026 •

edited

Loading

github-actions bot commented Apr 7, 2026 •

edited

Loading

github-actions bot commented Apr 7, 2026 •

edited

Loading