Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for image input support #68

Open
reachsak opened this issue Jun 3, 2024 · 1 comment
Open

Request for image input support #68

reachsak opened this issue Jun 3, 2024 · 1 comment

Comments

@reachsak
Copy link

reachsak commented Jun 3, 2024

I plan to implement the function calling with vision models such as LLaVA and Nous-Hermes-2-Vision-Alpha based on the image, but it seems that the current implementation in the example folder only supports text input. It'd be great to have the image input support in the future version. Or please let me know if know a workaround to add image input support for this.
Thank you,

@Maximilian-Winter
Copy link
Owner

@reachsak I will work on that. The problem I have at the moment, is that llama.cpp server stopped supporting images. But I will add it for vllm and TGI.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants