Attention Model

Motivation:

The motivation of attention model comes from the fact that the encoder/decoder of RNN needs to store the entire sentence in the encoder before decoder can make the prediction.

So, typically encoder/decoder performs well on the short sentence, but the BLUE score descreases as the sentence length get larger.

Attention model intuition

Attention model

is the amount of ‘attention’ should pay to

Computing attention