Skip to content

Speech-to-Text Model Comparison #34

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Cgarg9 opened this issue Mar 14, 2025 · 4 comments
Open

Speech-to-Text Model Comparison #34

Cgarg9 opened this issue Mar 14, 2025 · 4 comments
Labels
medium medium level difficulty pwoc

Comments

@Cgarg9
Copy link
Collaborator

Cgarg9 commented Mar 14, 2025

Description:

To help users understand different speech recognition methods, add a notebook that applies multiple models on the same dataset and compares results.

Tasks:

  • Compare CMU Sphinx, DeepSpeech, Wav2Vec 2.0, OpenAI Whisper.
  • Provide Word Error Rate (WER) and Sentence Error Rate (SER) comparisons.
  • Summarize key use cases and limitations for each model.
  • Name the notebook speech_to_text_comparison.ipynb.
  • Update the README file with relevant references.
@Cgarg9 Cgarg9 added medium medium level difficulty pwoc labels Mar 14, 2025
@Kanavpreet-Singh
Copy link
Contributor

can i work on this issue @Cgarg9

@Cgarg9
Copy link
Collaborator Author

Cgarg9 commented Mar 15, 2025

sure.

@Cgarg9
Copy link
Collaborator Author

Cgarg9 commented Mar 17, 2025

@Kanavpreet-Singh updates?

@Cgarg9
Copy link
Collaborator Author

Cgarg9 commented Mar 19, 2025

@Kanavpreet-Singh will close this by tomorrow if now response.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
medium medium level difficulty pwoc
Projects
None yet
Development

No branches or pull requests

2 participants