15 Dec 15:25

nSircombe

8830aff

Tool Solutions: ML Frameworks r25.12 Latest

Latest

PyTorch build

The pytorch-aarch64 r25.12 README.md is available here.
A Docker image is available on Docker Hub under armlimited/pytorch-arm-neoverse.
Attached as artefacts are:
- A full Software Bill of Materials (SBOM) as Tool-Solutions_r25.12_SPDX_SBOM.json, and;
- License texts as Tool-Solutions_r25.12_licenses.zip which cover all packages included in the built image, in addition to the base Ubuntu image.

Added

Adds oneDNN PR optimize brgemm.

Changed

Updates hashes for:
- PYTORCH_HASH to 93fef4bd1dd265588863929e35d9ac89328d5695, 2.10.0.dev20251124 from viable/strict, Nov 24th.
- IDEEP_HASH to 3724bec97a77ce990e8c6dc5e595bb3beee75257, from ideep_pytorch, Nov 24th.
- ONEDNN_HASH to 0b8a866c009b03f322e6526d7c33cfec84a4a97a, from main, Nov 25th.
- TORCH_AO_HASH to ab6bc89512d912c17a79ed8d4d709612d3e32884, from main, Nov 25th.
- KLEIDIAI_HASH to 94d6cc40689f44d308dbd57cb842e335fdd958f1, v1.17.0 from main, Nov 17th.

TensorFlow build

Added

Changed

Updates TensorFlow hash to 5d46b65af45d5694cb1676bc872d24a4a64a6b57, from nightly, Nov 25th

Note

Running tensorflow-aarch64/examples/run_unit_tests.sh with ONEDNN_DEFAULT_FPMATH_MODE=BF16 can lead to some failing tests. Addressing these failures is a WIP. However, all testing is expected to pass with the default setting ONEDNN_DEFAULT_FPMATH_MODE=FP32.

What's Changed

Bump hashes for r25.12 and patch PyTorch build by @puneetmatharu in #408
Bump actions/upload-artifact version by @puneetmatharu in #411
add conv optimizations by @Ryo-not-rio in #410
Updates CHANGELOGs for 25.12 release by @puneetmatharu in #412

Full diff: r25.11...r25.12

Contributors

puneetmatharu and Ryo-not-rio

Assets 4

20 Nov 10:26

puneetmatharu

r25.11

98012a0

Tool Solutions: ML Frameworks r25.11

PyTorch build

The pytorch-aarch64 r25.11 README.md is available here.
A Docker image is available on Docker Hub under armlimited/pytorch-arm-neoverse.
Attached as artefacts are:
- A full Software Bill of Materials (SBOM) as Tool-Solutions_r25.11_SPDX_SBOM.json, and;
- License texts as Tool-Solutions_r25.11_licenses.zip which cover all packages included in the built image, in addition to the base Ubuntu image.

Added

Adds PyTorch PR #167720, to allow missing cutlass file if CUDA disabled.
Adds oneDNN PR #4237, which improves cache blocking 4/8 inner blocks.
Adds PyTorch PR #167328 Build cpuinfo into c10 shared library.
Adds PyTorch PR #144992 Enable fp16 linear layers in PyTorch via ACL.

Changed

Now building ACL with PyTorch install_acl.sh script.
PyTorch manylinux AArch64 builder image from cpu-aarch64-a040006da76a51c4f660331e9abd3affe5a4bd81 to cpu-aarch64-d8be0384e085f551506bd739678109fa0f5ee7ac.
Updates hashes for:
- PYTORCH_HASH to 5ce4a8b49f9986b050a9f6fcc7dd4cf999baa509, 2.10.0.dev20251112 from viable/strict, Nov 12th.
- IDEEP_HASH to 927570638b237b0e39fb0626a868adffdbf70bbb, from ideep_pytorch, October 20th.
- ONEDNN_HASH to 80886d0559482dfe2019c2ae83eebd6d0d3a17d4, from main, Nov 9th.
- TORCH_AO_HASH to 17867e6788e4889b294449770f0275045384eab2, from main, Nov 8th.
- KLEIDIAI_HASH to 7bf4de9a56106f0fb0d57dfabeb4c7a2668deaf6, v1.16.0 from main, Nov 10th.
Replaced ACL_HASH=007264fa740de5723ebddef16b7bb3657692c088 with ACL_VERSION=v52.6.0, from main, October 31st.
Update torchvision from 0.25.0.dev20250923 to 0.25.0.dev20251104.

Removed

Removes patches which are no longer required.

TensorFlow build

Added

Adds TensorFlow PR #102272 to patch fixing CPUIDInfo initialization, re-enabling oneDNN+ACL path.

Changed

Updates TensorFlow hash to f40a10e064561ed66c03359c7751ccc5ae03dc4d, from nightly, Nov 10th.

Note

What's Changed

Separate PyTorch and TensorFlow CIs by @puneetmatharu in #390
Add TF CPUIDInfo fix patch by @davsva01 in #394
Enable fp16 linear layers in PyTorch via ACL by @renato-arantes in #396
[Feat] Improve CI by @Radu2k in #395
Update bert large model in examples/answer_questions.py by @puneetmatharu in #398
Add PyTorch PR #167328 patches: build cpuinfo into c10 shared library by @murste01 in #399
Patch build and update PyTorch/TensorFlow hashes by @puneetmatharu in #400
Delete (now unused) utils/build_openblas.sh by @puneetmatharu in #403
Remove third_party/cutlass from torchao by @puneetmatharu in #404
Patch setting of oneDNN fpmath mode in PyTorch CI by @puneetmatharu in #405
Patch quantization in PyTorch examples by @puneetmatharu in #401
Remove PyTorch float16 testing with bf16 fpmath mode by @puneetmatharu in #407
Updates CHANGELOGs for 25.11 release by @puneetmatharu in #402

New Contributors

@Radu2k made their first contribution in #395

Full Changelog: r25.10...r25.11

Contributors

Radu2k, puneetmatharu, and 3 other contributors

Assets 4

15 Oct 22:58

nSircombe

r25.10

f15faf1

Tool Solutions: ML Frameworks r25.10

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/pytorch-aarch64/README.md
Docker image available from: https://hub.docker.com/r/armlimited/pytorch-arm-neoverse
A full Software Bill of Materials (Tool-Solutions_r25.10_SPDX_SBOM.json) and license texts (Tool-Solutions_r25.10_licenses.zip) , covering all packages added to the base Ubuntu 22.04 image are attached as artefacts.

Added

Adds PyTorch PR to enable mimalloc by default on AArch64 build:
- Yields ~28% and ~20% mean improvement for fp32 and bf16 respectively.

Removed

Completely removes unnecessary python packages, including setuptools, from PyTorch Docker image.
Delete unused submodules of PyTorch's third-party modules.
Removed all references to tcmalloc as the default build now uses mimalloc.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/tensorflow-aarch64/README.md

Removed

Removes protobuf packages from Docker image that are not required for running tensorflow. Mitigation for CVE-2025-4565.
Removes system level python packages that aren't used.

Fixed

Updates Transformers to 4.53.0 as a mitigation for: CVE-2025-6051, CVE-2025-3933, CVE-2025-6638, CVE-2025-3262, CVE-2025-3263, CVE-2025-3264, CVE-2025-5197, CVE-2025-6921, CVE-2025-3777.
Updates huggingface to 0.34.0 for compatibility with updated Transformers.
Updates pip to >= 25.2 due to CVE-2025-8869 and CVE-2023-5752.
Updates wheel to >= 0.38.0 due to CVE-2022-40898.
Updates setuptools to >= 78.1.1 due to CVE-2024-6345, CVE-2022-40897 and CVE-2025-47273.

Full list of PRs

Correct torch version in CHANGELOG.md by @puneetmatharu in #381
Prune setuptools from PyTorch Docker image. by @puneetmatharu in #384
Patch comment in get-source.sh by @puneetmatharu in #385
Add 'utils' folder to precommit triggers by @puneetmatharu in #386
Upgrade packages found in Docker image and remove unused. by @almayne in #387
Add mimalloc support by @puneetmatharu in #388
pytorch: remove some unused transitive deps by @jondea in #389
Tidy up README and remove tcmalloc references. by @puneetmatharu in #391
Updates CHANGELOGS for 25.10 release by @nSircombe in #392

Full diff

r25.09...r25.10

Contributors

jondea, puneetmatharu, and 2 other contributors

Assets 4

30 Sep 10:04

nSircombe

r25.09

1fc6820

Tool Solutions: ML Frameworks r25.09

What's Changed

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/pytorch-aarch64/README.md

Changed

Updates ACL URL from ML Platform to GitHub.
Updates hashes for:
- PYTORCH_HASH to 45d9dcccc5bfb46a9eaca037270278bc3d7c95ea, 2.10.0.dev20250923 from viable/strict, September 23rd.
- IDEEP_HASH to fd11055f4800ac89291e30b5387a79a1e6496aa6, from ideep_pytorch, September 10th.
- ONEDNN_HASH to 9e8f619477469ed75d323d4915bf7a2513f01713, from main, September 23rd.
- ACL_HASH to 531a4968cecb7b4fc0a3b65482e2c524289e087e, from main, September 23rd.
- TORCH_AO_HASH to 8e2ca35ea603349e71c2467e10fd371e34bf52bc, from main, September 23rd.
- KLEIDIAI_HASH to bd2e6ae060014035e25bf4986be682762c446c2d, v1.14 from main.
Update torchvision from 0.23.0 to a nightly build, 0.25.0.dev20250923
Change of flag name in ./build.sh from --force to --fresh
Add intx_packing_format="opaque_aten_kleidiai" to Int8DynamicActivationIntxWeightConfig due to torchao API change
Updates dockerize.sh build to use BuildKit (to add extra build contexts).

Removed

Removes WIP ComputeLibrary patch https://review.mlplatform.org/c/ml/ComputeLibrary/+/12818/1.
Removed unused dependencies:
- Delete unused submodules to avoid fetching.
- Replace google-perftool bundle with specific libtcmalloc-minimal4 and add symlink so that you can still use it with LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4.
- Remove unused git apt package.
- Remove outdated versions of pip/setuptools/wheel from image after pip updates itself.
- Remove manual install of transformers in transformers_llm_text_gen.py, we now use a single version everywhere.
- Remove unused pip packages: datasets, omegaconf, psutil, tiktoken, torchaudio, torchdata and torchtune.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/tensorflow-aarch64/README.md

Added

Adds WIP patch to upgrade Compute Library to 52.4.0
Adds SECURITY.md to built image

Changed

Updates TensorFLow hash to 6aa8fd07270293b918255af5988aef45b844c5b4 # from nightly, September 22nd
Updates dockerize.sh build to use BuildKit (to add extra build contexts)
Change of flag name in ./build.sh from --force to --fresh

Removed

Removed previous patch updating oneDNN and Compute Library that has been merged upstream

Full list of PRs

add tensorflow ci by @Ryo-not-rio in #356
Copy torchao wheel to results/ directory by @puneetmatharu in #362
Update ComputeLibrary info in get-source.sh by @puneetmatharu in #361
Reduce dependecies by @jondea in #364
Remove more deps by @jondea in #365
Bump tensorflow version and update patches. by @almayne in #366
Bumps hashes for PyTorch build ahead of 25.09 release. by @puneetmatharu in #367
Move SECURITY.md to the top-level directory and tweak wording by @puneetmatharu in #369
Add file exclusions to precommit.yml by @puneetmatharu in #368
Make build.sh wipe old artefacts with --fresh by @puneetmatharu in #370
pytorch: fix perf issue from torchao API change by @jondea in #372
Add SECURITY.md to image by @puneetmatharu in #374
Adds r25.09 tag to CHANGELOGs by @nSircombe in #373
Patch clean-up for PyTorch and TensorFlow by @puneetmatharu in #375
Patch tensorflow docker build by @puneetmatharu in #376
Patch failing TensorFlow build in CI by @puneetmatharu in #379
Updated patch requires update to git hash. by @almayne in #378
pytorch: fix quality of vision instruct by using version=1. by @jondea in #380

New Contributors

@puneetmatharu made their first contribution in #362
@almayne made their first contribution in #366

Full diff

r25.08...r25.09

Contributors

jondea, puneetmatharu, and 3 other contributors

Assets 2

29 Aug 09:36

nSircombe

r25.08

95a3864

Tool Solutions: ML Frameworks r25.08

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.08/ML-Frameworks/pytorch-aarch64/README.md

Added

Adds pytorch/pytorch#159859, a WIP LUT implmentation of bf16 GELU
this gives an ~8x speedup on GELU and an ~1.8x speedup for attention for llama3.2 11B Vision (both on 16 threads).
Adds pytorch/pytorch#158250, to integrate INT4->BF16 via KleidiAI, with fallback.
Adds pytorch/pytorch#160080, a VLA PoC for PyTorch, and
pytorch/pytorch#161049, an optimised SVE exp_u20 implementation,
Note: there may be some regressions on Neoverse-V1 with this WIP patch.
Adds a new example script llama_vision_instruct.py to run and benchmark
Llama-3.2-11B-Vision-Instruct using text + image input and text output.
- Using Llama-3.2-11B-Vision-Instruct gives TTFT < 8s and a decode throughput of >16 tokens/s and with the base model, TTFT < 3.5s and throughput is > 20 tokens/s.
Temporarily removed --compile flag from transformers_llm_text_gen.py example as there is an unresolved issue with torch compile support in this case.

Changed

Updates hashes for:
- PYTORCH_HASH to 4e2ddb5db67617f9f5309c8bba0c17adc84cadbc, 2.9.0.dev20250808 from viable/strict, August 8th.
- IDEEP_HASH to 3527b0bf2127aa2de93810feb6906d173c24037f, from ideep_pytorch, August 1st.
- ONEDNN_HASH to 7e85b94b5f6be27b83c5435603ab67888b99da32, from main, August 1st.
- ACL_HASH to 3c32d706d0245dcb55181c8ced526eab05e2ff8d, from main, August 1st.
- TORCH_AO_HASH to 8d4a5d83d7be4d7807feabe38d37704c92d40900, from main, August 1st.
- KLEIDIAI_HASH to 8ca226712975f24f13f71d04cda039a0ee9f9e2f, v1.12 from main.
Update the examples/transformers_llm_text_gen.py to use the new quantizer api Int8DynamicActivationIntxWeightConfig.
Deleted torchchat_llm_text_gen.py.
Removed Dockerfile lines cloning TorchChat repo and setting safe.directory.
Updates huggingface_hub to 0.34.0.

Removed

pytorch/pytorch#151547, to update OpenBLAS commit as this has been merged upstream.

Fixed

Updates various Python packages to address known vulnerabilies with a high CVSS score:
- Updates Transformers to 4.55.2, this also provides a mitigation for CVE-2025-2099.
- Updates Wheel version to 0.38.0 as a mitigation for CVE-2022-40898.
- Updates setup-tools to 78.1.1 as a mitigation for CVE-2025-47273 and CVE-2024-6345.
- Updates Torchvison to 0.23 to avoid the need to use --extra-index-url
  this is the recomended mitgation against CVE-2018-20225, affecting all versions of pip.

Known issues

transformers_llm_text_gen.py example does not currently work in compiler mode.
Sample image downloads from https://upload.wikimedia.org may encounter a 403 response. The sample image can be downloaded separately via curl if needed.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.08/ML-Frameworks/tensorflow-aarch64/README.md

Changed

Updates TensorFlow hash to ab8aab720f1648f6a470b159b0d1aea3a5b0df81, 2.20.0-dev0 from master, 25th July 2025

Fixed

Updates Transformers to 4.50 as a mitigation for CVE-2025-2099.

Full list of PRs

Add smoke tests to precommit ci by @Ryo-not-rio in #341
build: update TF hashes by @Sqvid in #346
fix: correct typo in stderr redirect by @Sqvid in #347
[Update] Update new quantizer API usage, bump TorchAO version, and remove torchchat. by @gausah-arm in #348
Bumps hashes for PyTorch build ahead of 25.08 release. by @nSircombe in #349
Add unit tests to ci by @Ryo-not-rio in #345
pytorch: re-pin and reduce build pip requirements by @jondea in #351
pytorch: fix dockerize and deduplicate build logic by @jondea in #352
onednn - add brgemm bf16 patch by @aditew01 in #354
Adds PyTorch patch to implement bf16 GELU via LUT. by @nSircombe in #353
r25.08 updates by @nSircombe in #355
Updates requirements to add setuptools back by @nSircombe in #357
Minor corrections to CHANGELOG and examples README by @nSircombe in #358
Updates to CHANGELOG and docs. by @nSircombe in #359

Full diff: r25.07...r25.08

Contributors

jondea, nSircombe, and 4 other contributors

Assets 2

14 Jul 14:36

nSircombe

r25.07

df2eceb

Tool Solutions: ML Frameworks r25.07

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.07/ML-Frameworks/pytorch-aarch64/README.md

Changed

Updates Dockerfile to minimise layers in the final image.
Updates hashes for:
- PyTorch to 5dfd8a9c7a464bb42e81b8594eefd2fa865e5423, to 2.9.0.dev20250703 from viable/strict
- ideep to 6eb12eaad5e0f7d8c8613c744ac8ba5a0843cb99, from ideep_pytorch, July 3rd
- oneDNN to 0abfca1947b53c03ee74207e4710941ab6456f3b, from main, July 3rd
- Compute Library to f69b48afcc59f1b3b0d4544289249bebba489f0a, from main, June 26th

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.07/ML-Frameworks/tensorflow-aarch64/README.md

Changed

Updates Dockerfile to minimise layers in the final image.
Updates TensorFlow hash to 70d137e8794f97728e094608ec9ea1c6c9fd42af, 2.20.0-dev0 from nightly, July 1st

Full list of PRs

Bump PyTorch & Tensorflow versions for 25.07 release; update PR commits where necessary by @murste01 in #338
Add initial precommit ci by @Ryo-not-rio in #340
Updates for 25.07 release. by @nSircombe in #339
Fixes missing docker build arg in tensorflow build. by @nSircombe in #342
Removes unwanted DOCKER_BUILDKIT=1 setting by @nSircombe in #343

Full Diff: r25.06...r25.07

Contributors

nSircombe, Ryo-not-rio, and murste01

Assets 2

23 Jun 08:41

nSircombe

r25.06

8f49d77

Tool Solutions: ML Frameworks r25.06

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.06/ML-Frameworks/pytorch-aarch64/README.md

Added

Adds WIP patch to update the OpenBLAS commit - pytorch/pytorch#151547
- Speedups across most of the HF models via significant boost to SDPA layers.
- Overall torch-bench pass-rate increased
Adds support in github-apply-patch for local caching of patches.

Changed

Updates protobuf from 5.29.2 to 5.29.5 in response to CVE-2025-4565
Updates hashes for:
- PyTorch to 3040ca6d0f8558e39919b14eebeacc34ddf980f5 2.8.0.dev20250611 from viable/strict
- ideep to 2ef932a861439e4cc9bb8baee8424b57573de023 from ideep_pytorch, June 10.
- oneDNN to 106a7b41bc4156297b8a88cd1951304b739cc427 form main, June 10th
- Compute Library to 6bc1c7b8d0756272e2a97a7489e13de90f864326 from main, June 9th

Removed

Removes WIP patches which have now landed in the upstream nightly PyTorch builds.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.06/ML-Frameworks/tensorflow-aarch64/README.md

Added

Adds support in github-apply-patch for local caching of patches.
Adds WIP patch to tensorflow/tensorflow#93951
to update to oneDNN-3.7 + ACL-24.12, and address unit test failures

Changed

Updates TensorFlow hash to 65781570c55d2338106767de200323f123c3f91f
tensorflow-2.20.0.dev0 from main

Complete List of PRs

Bump commits and update patches for 25.06 by @murste01 in #332
Bump protobuf from 5.29.2 to 5.29.5 in /ML-Frameworks/pytorch-aarch64 by @dependabot in #333
Re-add patch which pins PyTorch requirements by @jondea in #334
Add Update OpenBLAS Commit patch to PyTorch by @murste01 in #336
Add patch cache to github-apply-patch by @jondea in #335
Update CHANGELOGs for 25.06 release. by @nSircombe in #337

New Contributors

@murste01 made their first contribution in #332

Full Diff: r25.05...r25.06

Contributors

jondea, dependabot, and 2 other contributors

Assets 2

16 May 11:50

nSircombe

r25.05

58b44ac

Tool Solutions: ML Frameworks r25.05

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.05/ML-Frameworks/pytorch-aarch64/README.md

Added

WIP patch to build a newer version of libgomp from source - pytorch/pytorch#152361
Improve scaling for >16 threads.
Support for authenticated GitHub access in apply-github-patch

Changed

Updates hashes for:
- PyTorch e872bf8f888bdbb27a03e03935db61babf7180b8, 2.8.0.dev20250430 from viable/strict
- ideep to 2ef932a861439e4cc9bb8baee8424b57573de023 from ideep_pytorch
- oneDNN to 69150ce5fe1f453af9125ca42a921e017092ccf7 from main
- Compute Library to 334108c0efc512efdc9576ba957dbcf5b7ee168a, rc_25_04_29_0
Updates existing WIP patches.

Removed

Removes WIP patches which have now landed in the upstream nightly PyTorch builds.

Fixed

Reverted this commit to add back --compile option. The issue was fixed upstream and is included in the bumped hash

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.05/ML-Frameworks/tensorflow-aarch64/README.md

Added

Support for authenticated GitHub access in apply-github-patch

Complete List of PRs

Bump sources for 25.05 by @karmeh01 in #325
Revert "Removes compile option from genai PyTorch examples" by @karmeh01 in #326
Modify CHANGELOG.md to update unreleased section by @karmeh01 in #327
Update tag naming scheme in CHANGELOGs by @nSircombe in #324
Adds patch to build libgomp from sources by @nSircombe in #328
Add option for auth. GitHub access and update CHANGELOGS for 25.05 by @nSircombe in #329
Catches unbound GITHUB_TOKEN. by @nSircombe in #330
Sets r25.05 release date. by @nSircombe in #331

New Contributors

@karmeh01 made their first contribution in #325

Full Changelog: r25.04...r25.05

Contributors

nSircombe and karmeh01

Assets 2

22 Apr 17:32

nSircombe

r25.04

95928c1

Tool Solutions: ML Frameworks r25.04

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.04/ML-Frameworks/pytorch-aarch64/README.md

Added

Work in progress oneDNN patch, Enable jit conv for 128 with ~30% speed up for backward convolutions
Add --wheel-only flag for only building the torch wheel
Work in progress PyTorch patch for pinning the major versions of build dependencies - pytorch/pytorch#150833

Changed

Updates hashes for:
- PyTorch fc674b45d4d8edfd4c630d89f71ea9f85a2f61f2, 2.8.0.dev20250403 from viable/strict
- ideep to 719d8e6cd7f7a0e01b155657526d693acf97c2b3 from ideep_pytorch
- oneDNN to 5de25f354afee38bf2db61f485c729d30f62c611 from main
- Compute Library to 9033bdacdc3840c80762bc56e8facb87b0e1048e, 25.03 release
- OpenBLAS to edef2e4441e50e3a2da1920fdbde09101087c43d from main
Updates existing WIP patches.
Update torchvision from 0.22.0.dev20250305 to a more recent nightly build, 0.22.0.dev20250403

Removed

Removes WIP patches which have now landed in the upstream nightly PyTorch builds.
Removes --tags --force from git clone command, and adds --depth=1 to speedup the checkout.
Temporarily removes --compile option from some examples due to an issue with pytorch/pytorch#147151
the compile path does not work as expected in these cases.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.04/ML-Frameworks/tensorflow-aarch64/README.md

Added

Enables patching of build outside of Bazel build.
default num_threads to max for acl_threadpool, see www.github.com/tensorflow/uxlfoundation/oneDNN/2958

Changed

Updates TensorFlow build to use oneDNN 3.7 + ACL 24.12, see www.github.com/tensorflow/tensorflow/pull/84975

Complete list of Pls

fix: typo in stdout to stderr redirection by @Sqvid in #312
Bump hashes ahead of r25.04 release. by @nSircombe in #313
Adds back BF16 SVE intrinsics via new patch by @nSircombe in #314
simplify apply-github-patch to implicitly use GitHub URL by @Sqvid in #315
Update torchvision version and PyTorch hash to fix vision test failure. by @nSircombe in #316
add common dirs and files to gitignore by @Sqvid in #317
Add oneDNN sve_128 jit conv patch for PyTorch by @jondea in #319
build TensorFlow against + oneDNN 3.7 + ACL 24.12 by @Sqvid in #318
More dev improvements by @jondea in #320
Pin major versions of PyTorch build deps by @jondea in #321
update pytorch src's by @aditew01 in #322
Update examples and CHANGELOG for r25.04 by @nSircombe in #323

New Contributors

@Sqvid made their first contribution in #312

Full Diff: r25.03.1...r25.04

Contributors

jondea, nSircombe, and 2 other contributors

Assets 2

27 Mar 08:24

nSircombe

r25.03.1

a8998a9

Tool Solutions: ML Frameworks r25.03.1

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.03.1/ML-Frameworks/pytorch-aarch64/README.md

Changed

Move torch* packages to latest stable to fix build error
- torchaudio==2.6.0.dev20250305 -> 2.6.0
- torchdata~=0.7.1 -> 0.11.0
- torchvision~=0.22.0.dev20250305 -> 0.21.0
- torchtune==0.6.0.dev20250115 -> 0.5.0

Fixed

Fix build error due to missing torchtune nightly build
Fix build error due to auditwheel version change by applying commits from pytorch/pytorch#149471

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.03.1/ML-Frameworks/tensorflow-aarch64/README.md
No changes to the TensorFlow build with this release.

Complete list of PRs

Fix missing torch* nightlies by using latest stable by @jondea in #307
Fix build error due to auditwheel version change by @jondea in #308
Update CHANGELOGs for patch release. by @nSircombe in #309
Add missing deps for torchchat example by @jondea in #310
Update CHANGELOGs for patch release. by @nSircombe in #311

Full Diff: r25.03...r25.03.1

Contributors

jondea and nSircombe

Assets 2

Releases: ARM-software/Tool-Solutions

Tool Solutions: ML Frameworks r25.12

PyTorch build

Added

Changed

TensorFlow build

Added

Changed

Note

What's Changed

Contributors

Uh oh!

Tool Solutions: ML Frameworks r25.11

PyTorch build

Added

Changed

Removed

TensorFlow build

Added

Changed

Note

What's Changed

New Contributors

Contributors

Uh oh!

Tool Solutions: ML Frameworks r25.10

PyTorch build

Added

Removed

TensorFlow build

Removed

Fixed

Full list of PRs

Full diff

Contributors

Uh oh!

Tool Solutions: ML Frameworks r25.09

What's Changed

PyTorch build

Changed

Removed

TensorFlow build

Added

Changed

Removed

Full list of PRs

New Contributors

Full diff

Contributors

Uh oh!

Tool Solutions: ML Frameworks r25.08

PyTorch build

Added

Changed

Removed

Fixed

Known issues

TensorFlow build

Changed

Fixed

Full list of PRs

Contributors

Uh oh!

Tool Solutions: ML Frameworks r25.07

PyTorch build

Changed

TensorFlow build

Changed

Full list of PRs

Contributors

Uh oh!

Tool Solutions: ML Frameworks r25.06

PyTorch build

Added

Changed

Removed

TensorFlow build

Added

Changed

Complete List of PRs