Fix typo error for lstm operations by BruceDai · Pull Request #385 · webmachinelearning/webnn

BruceDai · 2023-05-08T07:35:15Z

The fix is verified by the webnn-polyfill lstm and lstmCell implementations and tests: webmachinelearning/webnn-polyfill#227.
@wchao1115 @fdwr @huningxin PTAL, thanks.

Preview | Diff

fdwr · 2023-05-09T00:03:46Z

index.bs

            - *bias*: an {{MLOperand}}. The 1-D input bias tensor of shape [4 * hidden_size]. The ordering of the bias vectors in the first dimension of the tensor shape is specified according to the *options.layout* argument.
            - *recurrentBias*: an {{MLOperand}}. The 1-D recurrent bias tensor of shape [4 * hidden_size]. The ordering of the bias vectors in the first dimension of the tensor shape is specified according to the *options.layout* argument.
-            - *peepholeWeight*: an {{MLOperand}}. The 1-D weight tensor for peepholes of shape [3 * hidden_size]. The pack ordering of the weight vectors is for the *input (i)*, *output (o)*, and *forget (f)* gate respectively.
+            - *peepholeWeight*: an {{MLOperand}}. The 1-D weight tensor for peepholes of shape [4 * hidden_size]. The pack ordering of the weight vectors is for the *input (i)*, *output (o)*, and *forget (f)* gate respectively.


So I'm not sure about TF and PT (I can't figure out yet which one corresponds to the "peephole", since they evidently use a different term), but the DML API definitely shows PeepholeTensor sizes = { 1, 1, num_directions, 3 * hidden_size } and ONNX too P (optional, differentiable) : The weight tensor for peepholes. ... It has shape [num_directions, 3*hidden_size], not 4?

https://www.tensorflow.org/api_docs/python/tf/keras/layers/LSTM

https://pytorch.org/docs/stable/generated/torch.nn.LSTM.html

https://learn.microsoft.com/en-us/windows/win32/api/directml/ns-directml-dml_lstm_operator_desc

https://github.com/onnx/onnx/blob/main/docs/Operators.md#lstm

That would be consistent with the wording after it, concatenating 3 tensors: The pack ordering of the weight vectors is for the *input (i)*, *output (o)*, and *forget (f)* gate respectively.

Thanks @fdwr!

I've updated commit with referring to above your sharing materials and also this arXiv paper LSTM: A Search Space Odyssey, and updated implementation of lstm for WebNN-Polyfill API webmachinelearning/webnn-polyfill@b25e817, please take another look, thanks.

In the Spec, lstm operation describes peepholeWeight as bellowing:

peepholeWeight: an [MLOperand](https://webmachinelearning.github.io/webnn/#mloperand). The 2-D weight tensor for peepholes of shape [num_directions, 4 * hidden_size]. The pack ordering of the weight vectors is for the input (i), output (o), and forget (f) gate respectively.

~~So I did this modification for lstmCell operation to align with the second value of above shape [num_directions, 4 * hidden_size].~~
This is a typo error of peepholeWeight shape being [num_directions, 4 * hidden_size], it should be [num_directions, 3 * hidden_size], then it has the shape like next explanation line "The pack ordering of the weight vectors is for the input (i), output (o), and forget (f) gate respectively." .

And this shape [num_directions, 3 * hidden_size] could be verified by this much clearer descriptive part II. VANILLA LSTM paper of LSTM: A Search Space Odyssey, please see these three screenshots:

figure-1

figure-2

figure-3

but the DML API definitely shows PeepholeTensor sizes = { 1, 1, num_directions, 3 * hidden_size } and ONNX too P (optional, differentiable) : The weight tensor for peepholes. ... It has shape [num_directions, 3*hidden_size]

I verified that [3 * hidden_size] couldn't be the shape of peepholeWeight for lstm operation with the error when executing slice compute, while the shape of peepholeWeight for lstmCell operation being [3 * hidden_size] works.
The error when executing slice compute is due to not modify currentPeepholeWeight setting of for loop as below

for (let dir = 0; dir < numDirections; ++dir) { ..... currentPeepholeWeight.push(options.peepholeWeight ? - (builder.squeeze(builder.slice(options.peepholeWeight, [dir, 0], [1, 4 * hidden_size]), { axes: [0] })) : null); + (builder.squeeze(builder.slice(options.peepholeWeight, [dir, 0], [1, 3 * hidden_size]), { axes: [0] })) : null); }

Oof, you had to dig back into the original paper. Thank you for the spec wording and pseudocode fix.

… lstm op

fdwr

😎

wchao1115

Looks right. Thanks for fixing it.

anssiko · 2023-06-05T18:13:19Z

@huningxin feel free to merge at will if you're happy with this fix.

huningxin

Looks good, thanks @BruceDai !

wchao1115 · 2023-06-19T20:57:54Z

@BruceDai What is the reason this PR hasn't been merged? It was approved a few weeks ago?

BruceDai · 2023-06-20T02:39:06Z

@anssiko @huningxin Would you please help merge this PR, thanks.

anssiko · 2023-06-20T06:48:04Z

@BruceDai thanks for this contribution!

fdwr reviewed May 9, 2023

View reviewed changes

Corrected peepholeWeight shape and fixed pseudo sample code error for…

1732b82

… lstm op

BruceDai force-pushed the fix_lstm_lstmCell branch from 006af72 to 1732b82 Compare May 22, 2023 14:33

BruceDai changed the title ~~Fix error for lstm and lstmCell operations~~ Fix typo error for lstm operations May 22, 2023

fdwr approved these changes May 24, 2023

View reviewed changes

wchao1115 self-requested a review June 5, 2023 17:35

wchao1115 approved these changes Jun 5, 2023

View reviewed changes

anssiko requested a review from huningxin June 5, 2023 18:12

huningxin approved these changes Jun 8, 2023

View reviewed changes

anssiko merged commit 54d57b4 into webmachinelearning:main Jun 20, 2023

BruceDai mentioned this pull request Aug 22, 2023

Add missing algorithms, add stylistic improvements, update with spec conventions #446

Merged

dontcallmedom added the editorial label Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Fix typo error for lstm operations#385

Fix typo error for lstm operations#385
anssiko merged 1 commit intowebmachinelearning:mainfrom
BruceDai:fix_lstm_lstmCell

BruceDai commented May 8, 2023 •

edited by pr-preview bot

Loading

Uh oh!

fdwr May 9, 2023 •

edited

Loading

Uh oh!

BruceDai May 9, 2023 •

edited

Loading

Uh oh!

fdwr May 24, 2023

Uh oh!

fdwr left a comment

Uh oh!

wchao1115 left a comment

Uh oh!

anssiko commented Jun 5, 2023

Uh oh!

huningxin left a comment

Uh oh!

wchao1115 commented Jun 19, 2023

Uh oh!

BruceDai commented Jun 20, 2023

Uh oh!

anssiko commented Jun 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Comments

Conversation

BruceDai commented May 8, 2023 • edited by pr-preview bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fdwr May 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BruceDai May 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fdwr May 24, 2023

Choose a reason for hiding this comment

Uh oh!

fdwr left a comment

Choose a reason for hiding this comment

Uh oh!

wchao1115 left a comment

Choose a reason for hiding this comment

Uh oh!

anssiko commented Jun 5, 2023

Uh oh!

huningxin left a comment

Choose a reason for hiding this comment

Uh oh!

wchao1115 commented Jun 19, 2023

Uh oh!

BruceDai commented Jun 20, 2023

Uh oh!

anssiko commented Jun 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

BruceDai commented May 8, 2023 •

edited by pr-preview bot

Loading

fdwr May 9, 2023 •

edited

Loading

BruceDai May 9, 2023 •

edited

Loading