[doc] add warning about comparing hf and vllm outputs #10805

youkaichao · 2024-12-01T08:06:37Z

A lesson learned from #1069 (comment) .

In our ci, we always use do_sample=False for huggingface models, so it is not a problem.

Signed-off-by: youkaichao <[email protected]>

github-actions · 2024-12-01T08:06:50Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: youkaichao <[email protected]>

DarkLight1337 · 2024-12-01T08:32:23Z

Let's merge this first. Later we can implement #10758

youkaichao · 2024-12-01T08:39:45Z

oh i don't know #10758 . Is that what we want to implement? that's kind of bc-breaking.

DarkLight1337 · 2024-12-01T08:41:26Z

I would argue that #10758 is the "intended" behavior. To avoid breaking changes, we can log a warning for a while before finally switching over to using generation_config.json as the default.

youkaichao · 2024-12-01T08:43:59Z

generation_config.json is aligned with huggingface generate function, if we take default values from the file, the implementation will be coupled.

DarkLight1337 · 2024-12-01T08:46:57Z

Let's move the discussion to that issue so we can gather thoughts from the OP as well.

…0805) Signed-off-by: youkaichao <[email protected]> Signed-off-by: Andrew Feldman <[email protected]>

…0805) Signed-off-by: youkaichao <[email protected]> Signed-off-by: cedonley <[email protected]>

polish doc

cf00e89

Signed-off-by: youkaichao <[email protected]>

youkaichao mentioned this pull request Dec 1, 2024

Inconsistent results between HuggingFace Transformers and vllm #1069

Closed

mergify bot added the documentation Improvements or additions to documentation label Dec 1, 2024

update link

94970a6

Signed-off-by: youkaichao <[email protected]>

youkaichao requested a review from DarkLight1337 December 1, 2024 08:24

DarkLight1337 approved these changes Dec 1, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) December 1, 2024 08:32

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 1, 2024

youkaichao disabled auto-merge December 1, 2024 08:41

youkaichao merged commit 169a0ff into vllm-project:main Dec 1, 2024
13 of 20 checks passed

youkaichao deleted the hf_diff branch December 1, 2024 08:41

afeldman-nm pushed a commit to neuralmagic/vllm that referenced this pull request Dec 2, 2024

[doc] add warning about comparing hf and vllm outputs (vllm-project#1…

cf04e11

…0805) Signed-off-by: youkaichao <[email protected]> Signed-off-by: Andrew Feldman <[email protected]>

cedonley pushed a commit to cedonley/vllm that referenced this pull request Dec 7, 2024

[doc] add warning about comparing hf and vllm outputs (vllm-project#1…

a6c0961

…0805) Signed-off-by: youkaichao <[email protected]> Signed-off-by: cedonley <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[doc] add warning about comparing hf and vllm outputs #10805

[doc] add warning about comparing hf and vllm outputs #10805

youkaichao commented Dec 1, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 1, 2024

DarkLight1337 commented Dec 1, 2024

youkaichao commented Dec 1, 2024 •

edited

Loading

DarkLight1337 commented Dec 1, 2024

youkaichao commented Dec 1, 2024

DarkLight1337 commented Dec 1, 2024

[doc] add warning about comparing hf and vllm outputs #10805

[doc] add warning about comparing hf and vllm outputs #10805

Conversation

youkaichao commented Dec 1, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 1, 2024

DarkLight1337 commented Dec 1, 2024

youkaichao commented Dec 1, 2024 • edited Loading

DarkLight1337 commented Dec 1, 2024

youkaichao commented Dec 1, 2024

DarkLight1337 commented Dec 1, 2024

youkaichao commented Dec 1, 2024 •

edited by github-actions bot

Loading

youkaichao commented Dec 1, 2024 •

edited

Loading