Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable new models in audio-to-text #163

Merged
merged 17 commits into from
Nov 5, 2024
Merged

Conversation

eliteprox
Copy link
Collaborator

@eliteprox eliteprox commented Aug 17, 2024

This change adds support for new whisper models distil-whisper/distil-large-v3 and openai/whisper-medium.

It also optimizes those models to use the appropriate BFLOAT, FLOAT16 or FLOAT32 values.

Credit to @ad-astra-video for intially exploring these models and optimizations

runner/app/pipelines/audio_to_text.py Outdated Show resolved Hide resolved
@eliteprox
Copy link
Collaborator Author

@rickstaa I made several changes since you last reviewed this PR, so I held off on merging. Could you or @ad-astra-video re-review the latest changes?

This commit fixes some rebase conflicts that were introduced in the last
rebase.
This commit updates to A2T pipeline log so that it is clear the default
batch_size and chunk_length_s is used.
This commit removes the debug patch that was accidentally added.
Copy link
Collaborator

@rickstaa rickstaa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@eliteprox great work! Thanks!

@eliteprox eliteprox merged commit acf9b15 into livepeer:main Nov 5, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants