bug: Runs Exclusively on CPU #173

pkreissel · 2023-10-19T14:23:42Z

Describe the bug

This binding is about 10 times slower than native Whisper CPP because it is running exclusively on CPU on my M2 Device.
Whisper CPP runs fine on its own on the GPU, so there is no reason why this should not be possible for Python bindings.

To reproduce

I ran this code:

from whispercpp import Whisper

w = Whisper.from_pretrained("large")
transcript = w.transcribe_from_file("output.wav")

I compared with whisper cpp command:
./main -f output.wav -m models/ggml-large.bin -otxt

Expected behavior

Run on GPU and 10x faster

Environment

python 3.11
MacOS Sonoma
M2

The text was updated successfully, but these errors were encountered:

Jajcus · 2023-12-10T13:34:43Z

Strength of whisper.cpp comes with all the back-ends it can use (especially for non-nVidia GPU users – OpenVINO, OpenCL), unfortunately none of those seems to be supported in these bindings.

pkreissel added the bug Something isn't working label Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Runs Exclusively on CPU #173

bug: Runs Exclusively on CPU #173

pkreissel commented Oct 19, 2023

Jajcus commented Dec 10, 2023

bug: Runs Exclusively on CPU #173

bug: Runs Exclusively on CPU #173

Comments

pkreissel commented Oct 19, 2023

Describe the bug

To reproduce

Expected behavior

Environment

Jajcus commented Dec 10, 2023