feat: Complete dangling tool requests to avoid API errors #840

gadenbuie · 2025-11-06T17:55:40Z

Fixes #459 by completing incomplete tool requests with an empty tool result.

The implementation focuses on dangling tool requests only. In other words, when calling $chat() or $stream() or related async function via Chat, we check if the last message is an assistant message with unanswered tool requests and we add a new user turn with the empty tool results before processing the new input.

Here's a simple example with a tool to get a random number:

pkgload::load_all()
#> ℹ Loading ellmer

random_number_tool <- tool(
  function(n) sample(1:100, n),
  name = "pick_random_number",
  description = "Pick a random number between 1 and 100.",
  arguments = list(
    n = type_number("Number of random numbers to pick.")
  )
)

chat <- chat_openai(model = "gpt-4.1-nano")
# chat <- chat_anthropic(model = "claude-haiku-4-5-20251001")
chat$register_tool(random_number_tool)

chat$chat("Pick a single random number for me.")
#> ◯ [tool call] pick_random_number(n = 1L)
#> ● #> 95
#> I picked the number 95 for you.

If we pretend the chat was truncated, i.e. the user interrupted the action before we could send the tool result back to the LLM, the turns would look something like this:

chat$set_turns(chat$get_turns()[1:2]) # Keep user input and tool request
chat$get_turns()
#> [[1]]
#> <Turn: user>
#> Pick a single random number for me.
#> 
#> [[2]]
#> <Turn: assistant>
#> [tool request (call_EdrkWPAmiN5KbaGSghAUpuwo)]: pick_random_number(n = 1L)

Currently, trying to pick up the conversation at this point would result in an API error because the tool request wasn't completed. With this PR, however, the chat can be continued normally.

chat$chat("Try again")
#> ◯ [tool call] pick_random_number(n = 1L)
#> ● #> 52
#> The random number I picked for you is 52.

Inspecting the chat shows the new user turn with the empty tool request telling the LLM that the chat was interrupted and the tool wasn't invoked.

chat
#> <Chat OpenAI/gpt-4.1-nano turns=7 input=308 output=42 cost=$0.00>
#> ── user ────────────────────────────────────────────────────────────────────────
#> Pick a single random number for me.
#> ── assistant [input=65 output=15 cost=$0.00] ───────────────────────────────────
#> [tool request (call_EdrkWPAmiN5KbaGSghAUpuwo)]: pick_random_number(n = 1L)
#> ── user ────────────────────────────────────────────────────────────────────────
#> [tool result  (call_EdrkWPAmiN5KbaGSghAUpuwo)]: Error: Chat ended before the tool could be invoked.
#> ── user ────────────────────────────────────────────────────────────────────────
#> Try again
#> ── assistant [input=109 output=15 cost=$0.00] ──────────────────────────────────
#> [tool request (call_9qXCmh4p8QMwLi0TzXaeRrHe)]: pick_random_number(n = 1L)
#> ── user ────────────────────────────────────────────────────────────────────────
#> [tool result  (call_9qXCmh4p8QMwLi0TzXaeRrHe)]: 52
#> ── assistant [input=134 output=12 cost=$0.00] ──────────────────────────────────
#> The random number I picked for you is 52.

Note that we insert a separate user message primarily because of the bug discussed in #735: inserting tool results into the new user message hits the incorrect behavior described in the linked comment and doesn't work with OpenAI (and a few other providers). I also think conceptually it's better to have this stored as separate user messages, but it also wouldn't be hard to inject to tool results into the new actual user message after #735.

TODO

Add tests

gadenbuie · 2025-11-06T18:22:34Z

@hadley I requested a review to see if you have initial thoughts and reactions. We could tackle this separately or in combination with #735, and I still need to add tests.

hadley

This approach of providing a generic "tool request failed" result seems perfect to me!

R/chat.R

Co-authored-by: Hadley Wickham <[email protected]>

gadenbuie · 2025-11-13T14:56:12Z

R/chat.R

+      tool_results <- lapply(tool_requests, function(req) {
+        ContentToolResult(
+          error = "Chat ended before the tool could be invoked.",
+          request = req
+        )
+      })
+      self$add_turn(
+        tool_results_as_turn(tool_results),
+        AssistantTurn("Acknowledged", tokens = c(0, 0, 0))
+      )


I'm in favor of relaxing the user-assistant paired turns constraint, but I also understand that you probably don't want to do that in this release and that it's better to stay internally consistent.

I think this approach is clean and simple except for the faked assistant turn. In my initial implementation, which I'd recommend over adding an assistant turn, was to attach the tool results to the incoming user message.

That approach introduces a dependency on #735 (in the sense that it fixes a bug with tool results and text content being sent out of order), but that PR is almost done. I'll add a commit that takes this PR in that direction so you can see how that feels (and we can revert it if you prefer this approach).

Here's the commit 4037506. I consolidated the tests into one that uses vcr and updated that cassette in a follow up: 70313b1

gadenbuie · 2025-11-13T15:14:06Z

R/chat.R

  out <- paste0(prefix, "input=")

-  if (tokens[[3]] > 0) {
+  if (!is.na(tokens[[3]]) && tokens[[3]] > 0) {


Small note here, but now that tokens are structured, could this code use names instead of indices? Also, very minor nit, but I think res would be better than out; on my first read I thought out was for output tokens.

Tokens aren't structured in Turn objects yet 😬

hadley · 2025-11-13T17:08:58Z

R/chat.R

    #'   will be used.
    chat = function(..., echo = NULL) {
-      turn <- user_turn(...)
+      finish_tools <- private$complete_dangling_tool_requests()


This is much better!

tests/testthat/test-chat-tools.R

gadenbuie added 2 commits November 6, 2025 12:51

feat: Complete dangling tool requests to avoid API errors

bf4307e

chore: Add NEWS item

84b6a23

gadenbuie requested a review from hadley November 6, 2025 18:20

hadley approved these changes Nov 6, 2025

View reviewed changes

R/chat.R Outdated Show resolved Hide resolved

gadenbuie and others added 6 commits November 12, 2025 09:19

chore: return invisible() to signal side-effect fn

edc4f50

Co-authored-by: Hadley Wickham <[email protected]>

tests: Add test that dangling tool request don't break chat

e66929e

Merge branch 'main' into feat/dangling-tool-requests

217a97d

Eliminate need for cassette

c1d24fe

Return a pair of turns

974853e

Tweak news bullet

58e06ff

gadenbuie commented Nov 13, 2025

View reviewed changes

gadenbuie added 2 commits November 13, 2025 10:07

refactor: Finish dangling tool request as part of new user turn

4037506

tests: update vcr cassette

70313b1

gadenbuie commented Nov 13, 2025

View reviewed changes

hadley reviewed Nov 13, 2025

View reviewed changes

tests/testthat/test-chat-tools.R Show resolved Hide resolved

hadley and others added 4 commits November 13, 2025 11:52

Merged origin/main into feat/dangling-tool-requests

3aaeef6

Merge 'origin/main' into branch fix/tool-result-preview

7cdf5af

tests: Add test that doesn't involve API

d20bf83

Tweak tests

83683ed

hadley merged commit 8d794a6 into main Nov 13, 2025
11 checks passed

hadley deleted the feat/dangling-tool-requests branch November 13, 2025 21:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Complete dangling tool requests to avoid API errors #840

feat: Complete dangling tool requests to avoid API errors #840

gadenbuie commented Nov 6, 2025 •

edited

Loading

Uh oh!

gadenbuie commented Nov 6, 2025

Uh oh!

hadley left a comment

Uh oh!

Uh oh!

gadenbuie Nov 13, 2025

Uh oh!

gadenbuie Nov 13, 2025

Uh oh!

gadenbuie Nov 13, 2025

Uh oh!

hadley Nov 13, 2025

Uh oh!

hadley Nov 13, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Complete dangling tool requests to avoid API errors #840

feat: Complete dangling tool requests to avoid API errors #840

Conversation

gadenbuie commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Uh oh!

gadenbuie commented Nov 6, 2025

Uh oh!

hadley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gadenbuie Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gadenbuie Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gadenbuie Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

hadley Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

hadley Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gadenbuie commented Nov 6, 2025 •

edited

Loading