Seamless M4T audio input sample rate #389

tonyv · 2024-03-19T17:20:12Z

tonyv
Mar 19, 2024

I'm using the code provided by HuggingFace for Seamless M4T v1 to do translation for some audio files I have extracted from mp4 video recordings using ffmpeg (cmd used below for reference).

ffmpeg video_recording.mp4 -vn -acodec pcm_s16le -t 30 video_recording_0%d.wav

My understanding is that Seamless M4T v1 was trained on 16K audio . I had a couple of questions.

If the audio files I am providing have an original sample rate of 48K and the code resamples it to 16K, would that throw off the translations?
If Seamless was trained on 16K audio, can I pass it 48K audio or would that provide suboptimal translations?
Can seamless output the intermediate transcription of the audio before it performs the translation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seamless M4T audio input sample rate #389

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Seamless M4T audio input sample rate #389

tonyv Mar 19, 2024

Replies: 0 comments

tonyv
Mar 19, 2024