How to change the language recognition of Deepgram API? I want him to recognize it as Chinese instead of English. I tried to modify the language in DeepgramSTTModel in the transfer_models.py file, but still can only recognize English #189

willt0 · 2024-03-29T03:23:02Z

 def __init__(self, stt_model_config: dict):
        # Check for api_key
        if stt_model_config["api_key"] is None:
            raise Exception("Attempt to create Deepgram STT Model without an api key.")  # pylint: disable=W0719
        # self.lang = 'en-US'
        self.lang = 'zh-CN'

        print('[INFO] Using Deepgram API for transcription.')
        self.audio_model = DeepgramClient(stt_model_config["api_key"])

The text was updated successfully, but these errors were encountered:

abhinavuppal1 · 2024-03-29T15:09:27Z

The configuration is not clear from the issue description. Are you using command line parameters or override.yaml to use deepgram.

The observation is correct that deepgram is unable to recognize any other languages besides english.

I believe the following change will resolve the issue

Add the line
detect_language=True

here

transcribe/sdk/transcriber_models.py

Line 311 in f25f087

paragraphs=True)

The method will look like this with the additional option of detecting the language.

    def get_transcription(self, wav_file_path: str):
        """Get text using STT
        """
        try:
            with open(wav_file_path, "rb") as audio_file:
                buffer_data = audio_file.read()

            payload: FileSource = {
                "buffer": buffer_data
                }

            options = PrerecordedOptions(
                model="nova",
                smart_format=True,
                utterances=True,
                punctuate=True,
                paragraphs=True,
                detect_language=True)

            response = self.audio_model.listen.prerecorded.v("1").transcribe_file(payload, options)
            # This is not necessary and just a debugging aid
            with open('logs/deep.json', mode='a', encoding='utf-8') as deep_log:
                deep_log.write(response.to_json(indent=4))

            return response
        except Exception as exception:
            print(exception)

        return None

This should resolve the issue.

willt0 · 2024-03-29T15:52:20Z

Thank you！！！The problem has been resolved.

abhinavuppal1 · 2024-03-29T16:50:06Z

Resolved in #190.

willt0 closed this as completed Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to change the language recognition of Deepgram API? I want him to recognize it as Chinese instead of English. I tried to modify the language in DeepgramSTTModel in the transfer_models.py file, but still can only recognize English #189

How to change the language recognition of Deepgram API? I want him to recognize it as Chinese instead of English. I tried to modify the language in DeepgramSTTModel in the transfer_models.py file, but still can only recognize English #189

willt0 commented Mar 29, 2024

abhinavuppal1 commented Mar 29, 2024

willt0 commented Mar 29, 2024

abhinavuppal1 commented Mar 29, 2024

How to change the language recognition of Deepgram API? I want him to recognize it as Chinese instead of English. I tried to modify the language in DeepgramSTTModel in the transfer_models.py file, but still can only recognize English #189

How to change the language recognition of Deepgram API? I want him to recognize it as Chinese instead of English. I tried to modify the language in DeepgramSTTModel in the transfer_models.py file, but still can only recognize English #189

Comments

willt0 commented Mar 29, 2024

abhinavuppal1 commented Mar 29, 2024

willt0 commented Mar 29, 2024

abhinavuppal1 commented Mar 29, 2024