-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
dataset #2
Comments
Hi! |
Thanks for sharing your experiences my friend. I have one more question; Why there is not any spaces between the words? for example what happen if we build dataset like "gole ziba" instead of "goleziba"? |
In the speech synthesis from the phoneme sequence, space is not important. Because you have to separate the phonemes, consider the ID number for each phoneme, and use the sequence of IDs, so the spaces between the words are removed in this process. Of course, you can consider a new token (like other phonemes) for the space between words but note that usually the duration of this phoneme will be zero or very little. |
Hi!
I have a question about dataset. Suppose that I have several wavs and the corresponding text files of them (written in Persian language). How I can create phoneme_transcriptions of them?
The text was updated successfully, but these errors were encountered: