Need the abillity to save/re-use a generated voice #14

rmangino · 2024-04-11T13:12:53Z

We use TTS in an eLearning environment where we generate hundreds of videos per year. All of these videos must use the same exact voice for consistency.

To use Parler-TTS I'd need to be able to generate a voice (based upon a description), save it, then use it across multiple TTS sessions. We currently use Google's TTS api which allows me to select from a list of voices so that all of my TTS audio sounds exactly like the same speaker.

janewu77 · 2024-04-15T06:21:09Z

I'm also curious about how to maintain the consistency of the generated voice.

shuaijiang · 2024-04-16T02:29:46Z

Parler-TTS generate a similar but different voice with same discription but different Transcript text

juangea · 2024-04-17T13:53:54Z

For this to be useful we need to be able to select the voice, for example if I have a long video that I want to dub with thism, without being able to generate the exact same voice for all the text this is useless I'm afraid.

sanchit-gandhi · 2024-04-17T16:27:13Z

Thanks for the feedback all! Cross-posting a response from @ylacombe: https://huggingface.co/parler-tts/parler_tts_mini_v0.1/discussions/7#661fda86994005b654b417a4

In short, you can fine-tune Parler-TTS on a single speaker with as little as 30h of data. In doing so, you can fix the voice to the single speaker, while still maintaining the text description control.

As mentioned, we'll explore more voice control (e.g. through voice prompting) for the v1 release.

Jefferderp · 2024-04-19T19:36:10Z

Forgive the amateur question, but is Parler-TTS deterministic at all? Does each iteration have a seed associated? If so, could we potentially invoke with that same seed to gain more consistency between runs?

Guppy16 · 2024-08-17T14:27:43Z

#110

Does this help? I have a notebook demonstrating how to try and maintain voice consistency.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need the abillity to save/re-use a generated voice #14

Need the abillity to save/re-use a generated voice #14

rmangino commented Apr 11, 2024 •

edited

Loading

janewu77 commented Apr 15, 2024

shuaijiang commented Apr 16, 2024

juangea commented Apr 17, 2024

sanchit-gandhi commented Apr 17, 2024

Jefferderp commented Apr 19, 2024

Guppy16 commented Aug 17, 2024

Need the abillity to save/re-use a generated voice #14

Need the abillity to save/re-use a generated voice #14

Comments

rmangino commented Apr 11, 2024 • edited Loading

janewu77 commented Apr 15, 2024

shuaijiang commented Apr 16, 2024

juangea commented Apr 17, 2024

sanchit-gandhi commented Apr 17, 2024

Jefferderp commented Apr 19, 2024

Guppy16 commented Aug 17, 2024

rmangino commented Apr 11, 2024 •

edited

Loading