The comment in the Bi-LSTM (Attention) model has an issue.

The comment `# output : [batch_size, len_seq, n_hidden]` should indeed be corrected to `# output : [batch_size, len_seq, n_hidden*2]` because the Bi-LSTM model is bidirectional. In a bidirectional LSTM, the hidden size is effectively doubled, as it concatenates the forward and backward hidden states. Therefore, the correct shape of the `output` after permutation is `[batch_size, len_seq, n_hidden * 2]`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The comment in the Bi-LSTM (Attention) model has an issue. #84

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

The comment in the Bi-LSTM (Attention) model has an issue. #84

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions