Is there any example code base using this binding #297

cmingxu · 2023-11-22T10:33:19Z

I am wondering is there any project already use this project?

ChinaGISboyYang · 2023-11-22T13:38:11Z

me too

synw · 2023-11-25T16:49:17Z

There is my inference server project: https://github.com/synw/goinfer

RobinHeitz · 2024-04-05T09:36:10Z

@synw Sorry for off-topic question, but you might have some experience:

I'm testing around with LLama2 models, and I found that it's extremely slow, especially if there is a bit of context in the prompt. At the beginning it was at full workload (CPU wise), now its around 10-15% and the prediction takes like 30 mins.
I assume it looks different to you, right?

synw · 2024-04-05T18:53:47Z

Get a GPU to speed up the prompt processing
Use something recent and maintained: the Llama.cpp referenced in this library is too old

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any example code base using this binding #297

Is there any example code base using this binding #297

cmingxu commented Nov 22, 2023

ChinaGISboyYang commented Nov 22, 2023

synw commented Nov 25, 2023

RobinHeitz commented Apr 5, 2024

synw commented Apr 5, 2024

Is there any example code base using this binding #297

Is there any example code base using this binding #297

Comments

cmingxu commented Nov 22, 2023

ChinaGISboyYang commented Nov 22, 2023

synw commented Nov 25, 2023

RobinHeitz commented Apr 5, 2024

synw commented Apr 5, 2024