VAD parameter skipping loud segments and hallucinating during silence #1312
andremarko
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello guys,
In my scenario, I'm running Faster Whisper on a Brazilian Portuguese sports program, with VAD enabled.
With the default VAD parameters, some loud speech segments (especially in moments when the narrator shouts “goal!”) are not being transcribed.
However, when I lower the threshold and increase the negative threshold:
those segments are transcribed, but I start getting hallucinated text during silence, often repeated words.
Has anyone found an effective balance for this kind of content?
Beta Was this translation helpful? Give feedback.
All reactions