Request for image input support #68

reachsak · 2024-06-03T16:04:48Z

I plan to implement the function calling with vision models such as LLaVA and Nous-Hermes-2-Vision-Alpha based on the image, but it seems that the current implementation in the example folder only supports text input. It'd be great to have the image input support in the future version. Or please let me know if know a workaround to add image input support for this.
Thank you,

Maximilian-Winter · 2024-06-10T20:49:38Z

@reachsak I will work on that. The problem I have at the moment, is that llama.cpp server stopped supporting images. But I will add it for vllm and TGI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for image input support #68

Request for image input support #68

reachsak commented Jun 3, 2024

Maximilian-Winter commented Jun 10, 2024

Request for image input support #68

Request for image input support #68

Comments

reachsak commented Jun 3, 2024

Maximilian-Winter commented Jun 10, 2024