Transcription cut randomly when using a translated model. #1363
Unanswered
bansal-sid
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I used a fine-tuned model for Telugu language. The model is: "vasista22/whisper-telugu-base"
Since the model had to be translated first, I did so as per the guidelines mentioned using ctranslate2.
Just for reference, I translated the model using the following code:
Now when I tried to transcribe using this translated model, for all the files, partial transcription was gneerated. I had some audio files of approx 30 seconds.
Transcription would be generated for like only first 10-15 seconds, and nothing for the rest of the audio. Whatever transcription was generated was very much accurate.
The main thing that concerns me is that the same fine-tuned model, when I use it to transcribe in its original form i.e. without translating, I'm able to generate full transcriptions.
Accurate transcription code:
This behaviour was very strange to me given that, for English language, I didn't face any issues with the translated model using same steps.
Beta Was this translation helpful? Give feedback.
All reactions