[Feature Request] Constrain Available Languages when Autodetecting Language #1164

WesleyFister · 2024-11-22T06:57:23Z

Currently Faster-Whisper only allows you to specify a single language or attempt to detect the language out of a pool of 94 languages. I would like to be able to limit what languages can be detected. Something like the following to limit autodetection to only English, Spanish and French.
model.transcribe("audio.mp3", beam_size=5, language=["en", "es", "fr"])

The text was updated successfully, but these errors were encountered:

MahmoudAshraf97 · 2024-11-23T12:28:31Z

You can already do this, detect_language function retrns the probability of all languages, you can then exclude ll languages except these 3 and choose the one with the highest probability and pass it manually to transcribe

WesleyFister · 2024-11-23T18:33:35Z

I see, I don't think the version of Faster-Whisper I was using (1.0.3) allowed you to return language probabilities like this. I wrote some code to return the desired languages. It works fine but I still think it would simpler for the user if you could just pass in a language list in the transcribe function. I'll let you decide to close this issue or not.

from scipy.io import wavfile

def limit_languages(audio, allowed_languages):
    sampling_rate, audio_data = wavfile.read(audio)

    model = WhisperModel("large-v2", device="cpu", compute_type="int8")
    language, language_probability, all_language_probs = model.detect_language(audio_data)

    score = 0
    for language_code, language_prob in all_language_probs:
        for allowed_language in allowed_languages:
            if language_code == allowed_language:
                if language_prob > score:
                    score = language_prob
                    detected_language = language_code

    return detected_language```

George0828Zhang · 2024-12-18T06:26:07Z

You can already do this, detect_language function retrns the probability of all languages, you can then exclude ll languages except these 3 and choose the one with the highest probability and pass it manually to transcribe

Hi @MahmoudAshraf97 , what if multilingual=True? It does not seem possible here to limit the possible languages in a code-switched setting?

MahmoudAshraf97 · 2024-12-18T07:57:50Z

Yes, it's not possible

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Constrain Available Languages when Autodetecting Language #1164

[Feature Request] Constrain Available Languages when Autodetecting Language #1164

WesleyFister commented Nov 22, 2024

MahmoudAshraf97 commented Nov 23, 2024

WesleyFister commented Nov 23, 2024

George0828Zhang commented Dec 18, 2024 •

edited

Loading

MahmoudAshraf97 commented Dec 18, 2024

[Feature Request] Constrain Available Languages when Autodetecting Language #1164

[Feature Request] Constrain Available Languages when Autodetecting Language #1164

Comments

WesleyFister commented Nov 22, 2024

MahmoudAshraf97 commented Nov 23, 2024

WesleyFister commented Nov 23, 2024

George0828Zhang commented Dec 18, 2024 • edited Loading

MahmoudAshraf97 commented Dec 18, 2024

George0828Zhang commented Dec 18, 2024 •

edited

Loading