-
🌟 Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis,
arXiv, 2411.01156
, arxiv, pdf, cication: -1Shijia Liao, Yuxuan Wang, Tianyu Li, ..., Rongzhi Zhou, Yijin Xing · (fish-speech - fishaudio) · (𝕏)
-
🌟 Continuous Speech Synthesis using per-token Latent Diffusion,
arXiv, 2410.16048
, arxiv, pdf, cication: -1Arnon Turetzky, Nimrod Shabtay, Slava Shechtman, ..., Ron Hoory, Avihu Dekel · (s3.us-south.objectstorage.softlayer)
-
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer
-
Continuous Speech Synthesis using per-token Latent Diffusion,
arXiv, 2410.16048
, arxiv, pdf, cication: -1Arnon Turetzky, Nimrod Shabtay, Slava Shechtman, ..., Ron Hoory, Avihu Dekel
-
Parakeet A natural sounding, conversational text-to-speech model
-
F5-TTS - lpscr
A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching