fix(vLLM): Add tool calling support to VLLMClient.chat() #4889

kansalaman · 2026-01-23T07:28:12Z

Summary

Fixes NotImplementedError: Tool calling is not yet implemented in VLLMClient.chat(). #4871
Adds tool calling support to VLLMClient.chat() for vLLM server mode

Changes

Remove NotImplementedError check for tools in VLLMClient.chat()
Add tools parameter to HTTP request payload in client
Add tools field to ChatRequest model in server
Pass tools to vLLM's chat() method on server side

Test plan

Existing vLLM tests pass (pytest tests/ -k "vllm" -v)
Manual testing with vLLM server and tool calling

Fixes huggingface#4871 Previously, using GRPOTrainer with `vllm_mode="server"` raised a `NotImplementedError` when tools were passed to `VLLMClient.chat()`. This prevented users from using tool calling features with the vLLM server mode. Changes: - Remove the NotImplementedError check in VLLMClient.chat() - Add `tools` parameter to the HTTP request payload - Add `tools` field to ChatRequest model in vllm_serve.py - Pass tools to vLLM's chat() method on the server side

kansalaman force-pushed the fix-vllm-tool-calling branch from 2eeebac to e6b226a Compare January 23, 2026 07:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(vLLM): Add tool calling support to VLLMClient.chat() #4889

fix(vLLM): Add tool calling support to VLLMClient.chat() #4889

Uh oh!

kansalaman commented Jan 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix(vLLM): Add tool calling support to VLLMClient.chat() #4889

Are you sure you want to change the base?

fix(vLLM): Add tool calling support to VLLMClient.chat() #4889

Uh oh!

Conversation

kansalaman commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kansalaman commented Jan 23, 2026 •

edited

Loading