[Feature]: Configuration of CPU+GPU offload #355

Aisuko · 2024-08-09T00:08:47Z

Contact Details(optional)

No response

What feature are you requesting?

Currently, we didn't compile llamacpp with cuda accelerate. If we want to support use offload feature, we need to compile llamacpp with gpu label.

https://github.com/SkywardAI/llama.cpp/blob/a59f8fdc85e1119d470d8766e29617962549d993/examples/main/README.md?plain=1#L72

how many layer you want to run your model on GPU?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Configuration of CPU+GPU offload #355

[Feature]: Configuration of CPU+GPU offload #355

Aisuko commented Aug 9, 2024 •

edited

Loading

[Feature]: Configuration of CPU+GPU offload #355

[Feature]: Configuration of CPU+GPU offload #355

Comments

Aisuko commented Aug 9, 2024 • edited Loading

Contact Details(optional)

What feature are you requesting?

Aisuko commented Aug 9, 2024 •

edited

Loading