Does Dify have plans to support rerank and embedding models launched by vLLM? #11857

massif-01 · 2024-12-19T14:43:11Z

Self Checks

I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing? Tell me about your story.

Does Dify have plans to support rerank and embedding models launched by vLLM?
When deploying large models and Dify locally on performance-limited devices (such as Orin X or a Mac mini with 64GB of RAM), using Xinference to launch the models can consume precious memory. Using VLLM to launch various models together is a better choice.

2. Additional context or comments

No response

3. Can you help us with this feature?

I am interested in contributing to this feature.

HiddenPeak · 2024-12-19T14:45:52Z

+1

crazywoola · 2024-12-20T01:24:36Z

Because of this #11588, we no longer accept the prs related to model runtimes.
We are going to launch v1.0 in the near future. After that, I think the community could help with that.

HiddenPeak · 2024-12-20T20:04:45Z

vLLM rerank task api :

Route: /score, Methods: POST
Route: /v1/score, Methods: POST
I try to modify the route /v1/score to /v1/rerank
but dify show me the notice paramate error

waiting for your v1.0 😭

dosubot · 2025-01-22T16:00:50Z

Hi, @massif-01. I'm Dosu, and I'm helping the Dify team manage their backlog. I'm marking this issue as stale.

Issue Summary

You inquired about Dify's plans to support rerank and embedding models from vLLM for better memory efficiency.
HiddenPeak supported your inquiry and provided details about vLLM's rerank task API, noting a parameter error when modifying the route.
Crazywoola mentioned that due to a related issue, pull requests for model runtimes are not being accepted, but community help might be possible post Dify v1.0 launch.
HiddenPeak expressed anticipation for the Dify v1.0 release.

Next Steps

Please let us know if this issue is still relevant to the latest version of the Dify repository by commenting here.
If there is no further activity, this issue will be automatically closed in 15 days.

Thank you for your understanding and contribution!

hundredwz · 2025-02-17T02:37:43Z

vLLM rerank task api :

Route: /score, Methods: POST

Route: /v1/score, Methods: POST
I try to modify the route /v1/score to /v1/rerank
but dify show me the notice paramate error

waiting for your v1.0 😭

vLLM 0.7.2 now support /v1/rerank.

However, we will face 404 error when trying to add rerank model to dify, since there is a bug in rerank.py

we could modify the code in L67 to following

 data = {"model": model_name, "query": query, "documents": docs, "return_documents": True}
  if top_n is not None:
      data["top_n"] = top_n

If premitted, I could submit pr to fix it.

dosubot bot added the 🙋‍♂️ question This issue does not contain proper reproduce steps or it only has limited words without details. label Dec 19, 2024

dosubot bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Jan 22, 2025

dosubot bot closed this as not planned Won't fix, can't repro, duplicate, stale Feb 6, 2025

dosubot bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does Dify have plans to support rerank and embedding models launched by vLLM? #11857

Does Dify have plans to support rerank and embedding models launched by vLLM? #11857

massif-01 commented Dec 19, 2024

HiddenPeak commented Dec 19, 2024

crazywoola commented Dec 20, 2024

HiddenPeak commented Dec 20, 2024

dosubot bot commented Jan 22, 2025

hundredwz commented Feb 17, 2025 •

edited

Loading

Does Dify have plans to support rerank and embedding models launched by vLLM? #11857

Does Dify have plans to support rerank and embedding models launched by vLLM? #11857

Comments

massif-01 commented Dec 19, 2024

Self Checks

1. Is this request related to a challenge you're experiencing? Tell me about your story.

2. Additional context or comments

3. Can you help us with this feature?

HiddenPeak commented Dec 19, 2024

crazywoola commented Dec 20, 2024

HiddenPeak commented Dec 20, 2024

dosubot bot commented Jan 22, 2025

hundredwz commented Feb 17, 2025 • edited Loading

hundredwz commented Feb 17, 2025 •

edited

Loading