-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature: add INSTRUCTOR models as embedding backend #709
Comments
|
this is now possible as internal backends where moved to |
With #778 it was introduced the huggingface embedding in python - will follow along with docs soon but an example can be seen in https://github.com/go-skynet/LocalAI/blob/master/tests/models_fixtures/grpc.yaml , didn't tried yet with instructor embeddings but worth a shot? |
Hi, Thanks for your feedback. Not tested it yet. We can close an issue now. |
Is your feature request related to a problem? Please describe.
Describe the solution you'd like
These
instructor
models are well tuned for embeddings, as demonstrated here.But as I understand it, they cannot be used as an embedding backend in
localai
, because it is not bert or llama.cpp models.And as I see here, there is no way to use different embedding models than
llama.cpp
compatible models andbert
models.There is a way to use these embedding models via chroma, but it may be more reasonable to use embeddings via
localai
REST API.Why
localai
, but notchroma
?Because the simpler and more correct way would be to use
localai
as the backend for embeddings, because then end-user can scale this solution, e.g. in kubernetes and make changes to the architecture of that backend fairly quickly.How this would work through chroma is hard for me to say.
I think it's not quite right to use chroma as embeddings backend and also complicates the code a bit if someone decides to switch these models to openai bert for example via environment variable or something like that.
Example of model configuration:
Ref: https://github.com/go-skynet/model-gallery
Describe alternatives you've considered
Example of code with
instructor
models for embedding inchroma
vector database:Additional context
Links:
The text was updated successfully, but these errors were encountered: