Are uw and us global weights? just to conform. #18

acadTags · 2018-05-04T07:11:36Z

Thank you ematvey for this paper.

I wonder the uw and us are two vectors as global weights, or there are different uw(s) for each sentence, and different us(s) for each document?

From the code I think these are global vectors, am I right? Please help me confirm this.

As in the model_components.py it is said

Performs task-specific attention reduction, using learned
attention context vector (constant within task of interest).

The uw or us are defined in the function task_specific_attention(), although they are both referred to the attention_context_vector, but in the computational graph, are they different vectors? It would be helpful if you could explain a little about this part.

attention_context_vector = tf.get_variable(name='attention_context_vector', shape=[output_size], initializer=initializer, dtype=tf.float32)

Thank you.

The text was updated successfully, but these errors were encountered:

dugarsumit · 2018-08-26T09:42:19Z

I believe you are right. Uw and Us are the global context vectors that stores information about which words or sentences are most informative respectively. They are learned during the training process.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are uw and us global weights? just to conform. #18

Are uw and us global weights? just to conform. #18

acadTags commented May 4, 2018 •

edited

Loading

dugarsumit commented Aug 26, 2018

Are uw and us global weights? just to conform. #18

Are uw and us global weights? just to conform. #18

Comments

acadTags commented May 4, 2018 • edited Loading

dugarsumit commented Aug 26, 2018

acadTags commented May 4, 2018 •

edited

Loading