Might it better to change the loss for PET if one label corresponds to multiple label words? #14

xiningnlp · 2023-02-16T02:57:17Z

In the verbalizer, if one label (e.g. Fruit) corresponds to multiple label words (e.g. apple, pear, watermelon), and the prompted sentence is :

This is a [MASK] comment: it tasted good.

the loss function in your current code will boost the probability of predicting all the three words {apple, watermelon, pear} at the [MASK] position. For a PLM, different contexts would results in different probabilities of the three words. For example, "red" appears more likely around "apple" than "pear". If you boost the probability of generating "pear" around "red", it might ruin the knowledge in the PLM to some extent, or on the contrary increases the difficulties during prompt-tuning.

Maybe you can visit this paper to design the loss? https://aclanthology.org/2022.acl-long.158.pdf

HarderThenHarder · 2023-02-16T09:37:16Z

Sounds great, thanks for your suggestion. I'll try it when I'm free.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Might it better to change the loss for PET if one label corresponds to multiple label words? #14

Might it better to change the loss for PET if one label corresponds to multiple label words? #14

xiningnlp commented Feb 16, 2023

HarderThenHarder commented Feb 16, 2023

Might it better to change the loss for PET if one label corresponds to multiple label words? #14

Might it better to change the loss for PET if one label corresponds to multiple label words? #14

Comments

xiningnlp commented Feb 16, 2023

HarderThenHarder commented Feb 16, 2023