Attention weights #3

PietroAmin · 2020-04-22T19:49:29Z

In seq2seq.py the attention weights are computed like this:

attn_weights = F.softmax(
            self.attn(F.concat(embedded, hidden[0].flatten(), dim=1)))

Where embedded is the input of the decoder and hidden is the encoder's hidden as in the train you define hidden as: decoder_hidden = encoder_hidden. The problem is that as I found online in different sources the attention weights are computed with decoder's hidden and encoder's output.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention weights #3

Attention weights #3

PietroAmin commented Apr 22, 2020

Attention weights #3

Attention weights #3

Comments

PietroAmin commented Apr 22, 2020