Incorporate whisper.cpp for transcription #98

vivekuppal · 2023-11-26T19:31:35Z

Features

Incorporate whisper.cpp as one of the potential offline STT services
Incorporate the OAI latest large-v3 model
Download models automatically as and when required

Primary benefit of incorporating this is that whisper.cpp STT is consistently 35-40% faster in transcription when using GPU as compared to OAI whisper module.

Usage:

python main.py -stt whisper.cpp

Fixes

Move all log files to a separate logs folder
Move all utilities to utilities folder
Doc updates

Potential future improvements
Once below PR is merged it might remove the need for using ffmpeg to convert to 16 khz wav data file.
ggerganov/whisper.cpp#1549

Resolves #95

…ory.

Incorporate whisper.cpp for transcription.

d96e775

vivekuppal added the enhancement New feature or request label Nov 26, 2023

vivekuppal self-assigned this Nov 26, 2023

do not truncate logs.

be6232b

vivekuppal changed the title ~~[Draft] Incorporate whisper.cpp for transcription~~ Incorporate whisper.cpp for transcription Nov 26, 2023

vivekuppal added 2 commits November 27, 2023 11:02

Update whisper.cpp to version 1.5.1. Move all models to models direct…

7f66a2b

…ory.

Move utils to their folder.

a0c8fe3

vivekuppal requested a review from abhinavuppal1 November 27, 2023 16:47

abhinavuppal1 approved these changes Nov 27, 2023

View reviewed changes

Download models dynamically at run time.

874cb99

vivekuppal merged commit 1269110 into main Nov 27, 2023
2 checks passed

vivekuppal deleted the vu-whisper.cpp branch November 27, 2023 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorporate whisper.cpp for transcription #98

Incorporate whisper.cpp for transcription #98

vivekuppal commented Nov 26, 2023 •

edited

Loading

Incorporate whisper.cpp for transcription #98

Incorporate whisper.cpp for transcription #98

Conversation

vivekuppal commented Nov 26, 2023 • edited Loading

Features

Fixes

vivekuppal commented Nov 26, 2023 •

edited

Loading