Skip to content

Conversation

@kansalaman
Copy link
Contributor

@kansalaman kansalaman commented Jan 23, 2026

Summary

Changes

  • Remove NotImplementedError check for tools in VLLMClient.chat()
  • Add tools parameter to HTTP request payload in client
  • Add tools field to ChatRequest model in server
  • Pass tools to vLLM's chat() method on server side

Test plan

  • Existing vLLM tests pass (pytest tests/ -k "vllm" -v)
  • Manual testing with vLLM server and tool calling

Fixes huggingface#4871

Previously, using GRPOTrainer with `vllm_mode="server"` raised a
`NotImplementedError` when tools were passed to `VLLMClient.chat()`.
This prevented users from using tool calling features with the vLLM
server mode.

Changes:
- Remove the NotImplementedError check in VLLMClient.chat()
- Add `tools` parameter to the HTTP request payload
- Add `tools` field to ChatRequest model in vllm_serve.py
- Pass tools to vLLM's chat() method on the server side
@kansalaman kansalaman force-pushed the fix-vllm-tool-calling branch from 2eeebac to e6b226a Compare January 23, 2026 07:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

NotImplementedError: Tool calling is not yet implemented in VLLMClient.chat().

1 participant