Cortex-M backend: Add quantized int8 batch matmul (CMSIS-NN) by rascani · Pull Request #17799 · pytorch/executorch

rascani · 2026-03-02T22:31:40Z

Summary

Add cortex_m::quantized_batch_matmul wrapping arm_batch_matmul_s8. The RHS is always pre-transposed: constant RHS (parameters) are transposed at AOT time in the pass, dynamic RHS get a cortex_m::transpose node inserted in the graph.

It would be preferable if we could pre-compute or cache the constant RHS kernel sums, but I could not find any public CMSIS-NN APIs that would allow us to do so.

Fixes #16109

Authored with Claude.

Test plan

pytest backends/cortex_m/test/ops/test_batch_matmul.py

Add cortex_m::quantized_batch_matmul wrapping arm_batch_matmul_s8. The RHS is always pre-transposed: constant RHS (parameters) are transposed at AOT time in the pass, dynamic RHS get a cortex_m::transpose node inserted in the graph. Authored with Claude.

pytorch-bot · 2026-03-02T22:31:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17799

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Awaiting Approval, 3 New Failures

As of commit cb40758 with merge base 25f2a3f ():

AWAITING APPROVAL - The following workflow needs approval before CI can run:

periodic (gh)

NEW FAILURES - The following jobs have failed:

Build Presets / linux (llm, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 468eaf869ee411154b4e2fdc2282e32d0e5d85d49fe6045cb770b7f15d0f16d6 /exec failed with exit code 1
pull / test-samsung-quantmodels-linux / linux-job (gh)
RuntimeError: Command docker exec -t e6f72e580c2392368c765f927d59fdfe31bf390b763d9dabf95b2a7cb7245c6b /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-02T22:32:25Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

rascani requested review from AdrianLundell and psiddh March 2, 2026 22:31

rascani requested review from kirklandsign and larryliu0820 as code owners March 2, 2026 22:31

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cortex-M backend: Add quantized int8 batch matmul (CMSIS-NN)#17799

Cortex-M backend: Add quantized int8 batch matmul (CMSIS-NN)#17799
rascani wants to merge 1 commit intopytorch:mainfrom
rascani:cortex-m-batch-mm

rascani commented Mar 2, 2026

Uh oh!

pytorch-bot bot commented Mar 2, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rascani commented Mar 2, 2026

Summary

Test plan

Uh oh!

pytorch-bot bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17799

❌ 1 Awaiting Approval, 3 New Failures

Uh oh!

github-actions bot commented Mar 2, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot bot commented Mar 2, 2026 •

edited

Loading

This PR needs a `release notes:` label