Additive or multiplicative attention? #16

Ataxias · 2017-06-30T12:03:31Z

In the "Attentional Interfaces" section, there is a reference to "Bahdanau, et al. 2014: Neural machine translation by jointly learning to align and translate" (figure). In that paper, the attention vector is calculated through a feed-forward network, using the hidden states of the encoder and decoder as input (this is called "additive attention"). However, the schematic diagram of this section shows that the attention vector is calculated by using the dot product between the hidden states of the encoder and decoder (which is known as multiplicative attention). I believe that a short mention / clarification would be of benefit here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Additive or multiplicative attention? #16

Additive or multiplicative attention? #16

Ataxias commented Jun 30, 2017 •

edited

Loading

Additive or multiplicative attention? #16

Additive or multiplicative attention? #16

Comments

Ataxias commented Jun 30, 2017 • edited Loading

Ataxias commented Jun 30, 2017 •

edited

Loading