Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorporate whisper.cpp for transcription #98

Merged
merged 5 commits into from
Nov 27, 2023
Merged

Conversation

vivekuppal
Copy link
Owner

@vivekuppal vivekuppal commented Nov 26, 2023

Features

  • Incorporate whisper.cpp as one of the potential offline STT services
  • Incorporate the OAI latest large-v3 model
  • Download models automatically as and when required

Primary benefit of incorporating this is that whisper.cpp STT is consistently 35-40% faster in transcription when using GPU as compared to OAI whisper module.

Usage:

python main.py -stt whisper.cpp

Fixes

  • Move all log files to a separate logs folder
  • Move all utilities to utilities folder
  • Doc updates

Potential future improvements
Once below PR is merged it might remove the need for using ffmpeg to convert to 16 khz wav data file.
ggerganov/whisper.cpp#1549

Resolves #95

@vivekuppal vivekuppal added the enhancement New feature or request label Nov 26, 2023
@vivekuppal vivekuppal self-assigned this Nov 26, 2023
@vivekuppal vivekuppal changed the title [Draft] Incorporate whisper.cpp for transcription Incorporate whisper.cpp for transcription Nov 26, 2023
@vivekuppal vivekuppal merged commit 1269110 into main Nov 27, 2023
2 checks passed
@vivekuppal vivekuppal deleted the vu-whisper.cpp branch November 27, 2023 20:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Incorporate the faster whisper.cpp into Transcribe
2 participants