Skip to content

whisper.cpp accuracy #1035

Answered by bobqianic
gauvainjl asked this question in Q&A
Jun 21, 2023 · 3 comments · 4 replies
Discussion options

You must be logged in to vote

I found older messages about the same topic. I understand that whisper.cpp does not implement the same decoding strategy as openai code, meaning we should not expect the same accuracy. A 30% relative difference in the WER is in fact a huge difference when comparing different decodings with the same model. Differences in decoding for speech recognizers usually does not impact the WER by more than a few percent relative. Are there any plans to reduce this large gap?

We've wrapped up our analysis comparing the log_mel_spectrogram generation between whisper.cpp and OpenAI's Whisper.

To summarize the main issues we found in whisper.cpp:

  1. The Stage-1 padding (zero padding) is inadequate. Whi…

Replies: 3 comments 4 replies

Comment options

You must be logged in to vote
1 reply
@gauvainjl
Comment options

Comment options

You must be logged in to vote
3 replies
@bobqianic
Comment options

@gauvainjl
Comment options

@bobqianic
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by bobqianic
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants