Impact of RoPE and Adding Prompts in Cross Attention on Model Performance #106

LiuZH-19 · 2024-08-15T05:06:14Z

Hello,

I noticed that in the recent architecture improvements, modules for RoPE positional encoding and adding Prompts in Cross Attention were included. However, it seems that the newly released two Parler-TTS checkpoints did not utilize these features (if I understood correctly). Do you have any ablation study results on the impact of using RoPE positional encoding and adding Prompts in Cross Attention? I’m interested in understanding how each of these modules affects the final model performance.

Additionally，is there a plan to update the training guide for the latest checkpoints? I’m particularly keen on learning how to fine-tune the new checkpoints.

Thank you for your amazing work!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Impact of RoPE and Adding Prompts in Cross Attention on Model Performance #106

Impact of RoPE and Adding Prompts in Cross Attention on Model Performance #106

LiuZH-19 commented Aug 15, 2024

Impact of RoPE and Adding Prompts in Cross Attention on Model Performance #106

Impact of RoPE and Adding Prompts in Cross Attention on Model Performance #106

Comments

LiuZH-19 commented Aug 15, 2024