You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Possible, but not on every segment nor VAD segment, language will be autodetected on every chunk with multilingual=True option, but it's not guaranteed that a chunk contains only one language, so this option is based on wishful thinking.
If you want to try autodection on every VAD segment then I implemented such idea in Faster-Whisper-XXL, to run batched on unmerged VAD segments, activate this with these options: --batched --unmerged --multilingual true, maybe add --vad_method pyannote_onnx_v3 --verbose true too.
当开启VAD时,会将音频切分成不同的片段,但是语种检测还是基于前面的片段。如果前后语言变化比较多,就会造成输出只有一种语言的情况。
那能否在每个片段上重新识别语种,初始化token,然后再输出呢?
The text was updated successfully, but these errors were encountered: