Speech-to-Text Model Comparison #34

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

Cgarg9 opened this issue Mar 14, 2025 · 4 comments

Labels

Collaborator

Cgarg9 commented Mar 14, 2025

Description:

To help users understand different speech recognition methods, add a notebook that applies multiple models on the same dataset and compares results.

Tasks:

Compare CMU Sphinx, DeepSpeech, Wav2Vec 2.0, OpenAI Whisper.
Provide Word Error Rate (WER) and Sentence Error Rate (SER) comparisons.
Summarize key use cases and limitations for each model.
Name the notebook speech_to_text_comparison.ipynb.
Update the README file with relevant references.

Cgarg9 added medium pwoc labels

Contributor

Kanavpreet-Singh commented Mar 15, 2025

can i work on this issue @Cgarg9

Collaborator Author

Cgarg9 commented Mar 15, 2025

sure.

Cgarg9 assigned Kanavpreet-Singh

Collaborator Author

Cgarg9 commented Mar 17, 2025

@Kanavpreet-Singh updates?

Collaborator Author

Cgarg9 commented Mar 19, 2025

@Kanavpreet-Singh will close this by tomorrow if now response.

Cgarg9 unassigned Kanavpreet-Singh

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment