Are there plans for Intel AMX support in ggml? #2051

jgjl · 2023-06-29T17:43:13Z

jgjl
Jun 29, 2023

I see that most of the x86 acceleration APIs are supported in llama.cpp, which is great! Intel AMX seems to have even more potential for speeding up inference.

AMX is support by Intel's latest server and workstation CPUs. Right now, the performance on my Intel w7-2495 AMX-enabled system (eval ~8 tokens per second) is lower than my MacBook Pro M1 (eval ~11 tokens per second).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are there plans for Intel AMX support in ggml? #2051

{{title}}

Replies: 0 comments

Select a reply

Are there plans for Intel AMX support in ggml? #2051

jgjl Jun 29, 2023

Replies: 0 comments

jgjl
Jun 29, 2023