Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Streaming setup in the whisper tokenizer #147

Open
zsLin177 opened this issue Jan 8, 2025 · 0 comments
Open

Streaming setup in the whisper tokenizer #147

zsLin177 opened this issue Jan 8, 2025 · 0 comments

Comments

@zsLin177
Copy link

zsLin177 commented Jan 8, 2025

Hello,

Thanks for the nice work! I noticed in your paper that the Whisper encoder supports "Causality for Streaming Inference." However, in the currently released model, I observed that both encoder_causal_attention and quantize_causal_encoder are set to false in the tokenizer's configuration. Whether the current version of the tokenizer supports streaming input for speech encoding?

Additionally, could you explain the impact of setting quantize_causal_block_size to 200?

Thank you for your time and assistance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant