Replies: 2 comments
-
Could you describe how users would use it if they don't know what ssml tags are? Currently, users only need to enter normal text to get audio samples. |
Beta Was this translation helpful? Give feedback.
0 replies
-
You can write in project description about it. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Are you going to support ssml tags? Example:
Supported SSML Tags - Amazon Polly
https://docs.aws.amazon.com/polly/latest/dg/supportedtags.html#break-tag
At least pauses between sentences would be a huge difference.
On top of that, a lot of pre-trained models speak very fast and because of this don't sound natural. There are sometimes even no distinct breaks between words (for example vctk models available for Piper). I haven't checked if you use this models. They sound quite natural but are so fast that are almost unusable (at least in some cases).
Beta Was this translation helpful? Give feedback.
All reactions