You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If the length of content code, rhythm code and pitch code is different from each other, how do they align since there is no attention mechanism in decoder?
The text was updated successfully, but these errors were encountered:
The rhythm code provides the alignment information. The decoder just use this information automatically to align the content code and/or pitch code.
Is it right in your code(model.py: line308-309) when content code, rhythm code and pitch code are from different utterance, since the three have alignment problem. I don't find details of the variant to get alignment information propose in your papar, Appendix B.3, can you tell me the details. thank you.
If the length of content code, rhythm code and pitch code is different from each other, how do they align since there is no attention mechanism in decoder?
The text was updated successfully, but these errors were encountered: