Skip to content

Releases: ARM-software/Tool-Solutions

Tool Solutions: ML Frameworks r25.12

15 Dec 15:25
8830aff

Choose a tag to compare

PyTorch build

  • The pytorch-aarch64 r25.12 README.md is available here.
  • A Docker image is available on Docker Hub under armlimited/pytorch-arm-neoverse.
  • Attached as artefacts are:
    • A full Software Bill of Materials (SBOM) as Tool-Solutions_r25.12_SPDX_SBOM.json, and;
    • License texts as Tool-Solutions_r25.12_licenses.zip which cover all packages included in the built image, in addition to the base Ubuntu image.

Added

  • Adds oneDNN PR optimize brgemm.

Changed

  • Updates hashes for:
    • PYTORCH_HASH to 93fef4bd1dd265588863929e35d9ac89328d5695, 2.10.0.dev20251124 from viable/strict, Nov 24th.
    • IDEEP_HASH to 3724bec97a77ce990e8c6dc5e595bb3beee75257, from ideep_pytorch, Nov 24th.
    • ONEDNN_HASH to 0b8a866c009b03f322e6526d7c33cfec84a4a97a, from main, Nov 25th.
    • TORCH_AO_HASH to ab6bc89512d912c17a79ed8d4d709612d3e32884, from main, Nov 25th.
    • KLEIDIAI_HASH to 94d6cc40689f44d308dbd57cb842e335fdd958f1, v1.17.0 from main, Nov 17th.

TensorFlow build

Added

Changed

  • Updates TensorFlow hash to 5d46b65af45d5694cb1676bc872d24a4a64a6b57, from nightly, Nov 25th

Note

Running tensorflow-aarch64/examples/run_unit_tests.sh with ONEDNN_DEFAULT_FPMATH_MODE=BF16 can lead to some failing tests. Addressing these failures is a WIP. However, all testing is expected to pass with the default setting ONEDNN_DEFAULT_FPMATH_MODE=FP32.

What's Changed

Full diff: r25.11...r25.12

Tool Solutions: ML Frameworks r25.11

20 Nov 10:26
98012a0

Choose a tag to compare

PyTorch build

  • The pytorch-aarch64 r25.11 README.md is available here.
  • A Docker image is available on Docker Hub under armlimited/pytorch-arm-neoverse.
  • Attached as artefacts are:
    • A full Software Bill of Materials (SBOM) as Tool-Solutions_r25.11_SPDX_SBOM.json, and;
    • License texts as Tool-Solutions_r25.11_licenses.zip which cover all packages included in the built image, in addition to the base Ubuntu image.

Added

  • Adds PyTorch PR #167720, to allow missing cutlass file if CUDA disabled.
  • Adds oneDNN PR #4237, which improves cache blocking 4/8 inner blocks.
  • Adds PyTorch PR #167328 Build cpuinfo into c10 shared library.
  • Adds PyTorch PR #144992 Enable fp16 linear layers in PyTorch via ACL.

Changed

  • Now building ACL with PyTorch install_acl.sh script.
  • PyTorch manylinux AArch64 builder image from cpu-aarch64-a040006da76a51c4f660331e9abd3affe5a4bd81 to cpu-aarch64-d8be0384e085f551506bd739678109fa0f5ee7ac.
  • Updates hashes for:
    • PYTORCH_HASH to 5ce4a8b49f9986b050a9f6fcc7dd4cf999baa509, 2.10.0.dev20251112 from viable/strict, Nov 12th.
    • IDEEP_HASH to 927570638b237b0e39fb0626a868adffdbf70bbb, from ideep_pytorch, October 20th.
    • ONEDNN_HASH to 80886d0559482dfe2019c2ae83eebd6d0d3a17d4, from main, Nov 9th.
    • TORCH_AO_HASH to 17867e6788e4889b294449770f0275045384eab2, from main, Nov 8th.
    • KLEIDIAI_HASH to 7bf4de9a56106f0fb0d57dfabeb4c7a2668deaf6, v1.16.0 from main, Nov 10th.
  • Replaced ACL_HASH=007264fa740de5723ebddef16b7bb3657692c088 with ACL_VERSION=v52.6.0, from main, October 31st.
  • Update torchvision from 0.25.0.dev20250923 to 0.25.0.dev20251104.

Removed

  • Removes patches which are no longer required.

TensorFlow build

Added

  • Adds TensorFlow PR #102272 to patch fixing CPUIDInfo initialization, re-enabling oneDNN+ACL path.

Changed

  • Updates TensorFlow hash to f40a10e064561ed66c03359c7751ccc5ae03dc4d, from nightly, Nov 10th.

Note

Running tensorflow-aarch64/examples/run_unit_tests.sh with ONEDNN_DEFAULT_FPMATH_MODE=BF16 can lead to some failing tests. Addressing these failures is a WIP. However, all testing is expected to pass with the default setting ONEDNN_DEFAULT_FPMATH_MODE=FP32.

What's Changed

New Contributors

Full Changelog: r25.10...r25.11

Tool Solutions: ML Frameworks r25.10

15 Oct 22:58
f15faf1

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/pytorch-aarch64/README.md
Docker image available from: https://hub.docker.com/r/armlimited/pytorch-arm-neoverse
A full Software Bill of Materials (Tool-Solutions_r25.10_SPDX_SBOM.json) and license texts (Tool-Solutions_r25.10_licenses.zip) , covering all packages added to the base Ubuntu 22.04 image are attached as artefacts.

Added

  • Adds PyTorch PR to enable mimalloc by default on AArch64 build:
    • Yields ~28% and ~20% mean improvement for fp32 and bf16 respectively.

Removed

  • Completely removes unnecessary python packages, including setuptools, from PyTorch Docker image.
  • Delete unused submodules of PyTorch's third-party modules.
  • Removed all references to tcmalloc as the default build now uses mimalloc.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/tensorflow-aarch64/README.md

Removed

  • Removes protobuf packages from Docker image that are not required for running tensorflow. Mitigation for CVE-2025-4565.
  • Removes system level python packages that aren't used.

Fixed

Full list of PRs

Full diff

r25.09...r25.10

Tool Solutions: ML Frameworks r25.09

30 Sep 10:04
1fc6820

Choose a tag to compare

What's Changed

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/pytorch-aarch64/README.md

Changed

  • Updates ACL URL from ML Platform to GitHub.
  • Updates hashes for:
    • PYTORCH_HASH to 45d9dcccc5bfb46a9eaca037270278bc3d7c95ea, 2.10.0.dev20250923 from viable/strict, September 23rd.
    • IDEEP_HASH to fd11055f4800ac89291e30b5387a79a1e6496aa6, from ideep_pytorch, September 10th.
    • ONEDNN_HASH to 9e8f619477469ed75d323d4915bf7a2513f01713, from main, September 23rd.
    • ACL_HASH to 531a4968cecb7b4fc0a3b65482e2c524289e087e, from main, September 23rd.
    • TORCH_AO_HASH to 8e2ca35ea603349e71c2467e10fd371e34bf52bc, from main, September 23rd.
    • KLEIDIAI_HASH to bd2e6ae060014035e25bf4986be682762c446c2d, v1.14 from main.
  • Update torchvision from 0.23.0 to a nightly build, 0.25.0.dev20250923
  • Change of flag name in ./build.sh from --force to --fresh
  • Add intx_packing_format="opaque_aten_kleidiai" to Int8DynamicActivationIntxWeightConfig due to torchao API change
  • Updates dockerize.sh build to use BuildKit (to add extra build contexts).

Removed

  • Removes WIP ComputeLibrary patch https://review.mlplatform.org/c/ml/ComputeLibrary/+/12818/1.
  • Removed unused dependencies:
    • Delete unused submodules to avoid fetching.
    • Replace google-perftool bundle with specific libtcmalloc-minimal4 and add symlink so that you can still use it with LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4.
    • Remove unused git apt package.
    • Remove outdated versions of pip/setuptools/wheel from image after pip updates itself.
    • Remove manual install of transformers in transformers_llm_text_gen.py, we now use a single version everywhere.
    • Remove unused pip packages: datasets, omegaconf, psutil, tiktoken, torchaudio, torchdata and torchtune.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.09/ML-Frameworks/tensorflow-aarch64/README.md

Added

  • Adds WIP patch to upgrade Compute Library to 52.4.0
  • Adds SECURITY.md to built image

Changed

  • Updates TensorFLow hash to 6aa8fd07270293b918255af5988aef45b844c5b4 # from nightly, September 22nd
  • Updates dockerize.sh build to use BuildKit (to add extra build contexts)
  • Change of flag name in ./build.sh from --force to --fresh

Removed

  • Removed previous patch updating oneDNN and Compute Library that has been merged upstream

Full list of PRs

New Contributors

Full diff

r25.08...r25.09

Tool Solutions: ML Frameworks r25.08

29 Aug 09:36
95a3864

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.08/ML-Frameworks/pytorch-aarch64/README.md

Added

  • Adds pytorch/pytorch#159859, a WIP LUT implmentation of bf16 GELU
    this gives an ~8x speedup on GELU and an ~1.8x speedup for attention for llama3.2 11B Vision (both on 16 threads).
  • Adds pytorch/pytorch#158250, to integrate INT4->BF16 via KleidiAI, with fallback.
  • Adds pytorch/pytorch#160080, a VLA PoC for PyTorch, and
    pytorch/pytorch#161049, an optimised SVE exp_u20 implementation,
    Note: there may be some regressions on Neoverse-V1 with this WIP patch.
  • Adds a new example script llama_vision_instruct.py to run and benchmark
    Llama-3.2-11B-Vision-Instruct using text + image input and text output.
     - Using Llama-3.2-11B-Vision-Instruct gives TTFT < 8s and a decode throughput of >16 tokens/s and with the base model, TTFT < 3.5s and throughput is > 20 tokens/s.
  • Temporarily removed --compile flag from transformers_llm_text_gen.py example as there is an unresolved issue with torch compile support in this case.

Changed

  • Updates hashes for:
    • PYTORCH_HASH to 4e2ddb5db67617f9f5309c8bba0c17adc84cadbc, 2.9.0.dev20250808 from viable/strict, August 8th.
    • IDEEP_HASH to 3527b0bf2127aa2de93810feb6906d173c24037f, from ideep_pytorch, August 1st.
    • ONEDNN_HASH to 7e85b94b5f6be27b83c5435603ab67888b99da32, from main, August 1st.
    • ACL_HASH to 3c32d706d0245dcb55181c8ced526eab05e2ff8d, from main, August 1st.
    • TORCH_AO_HASH to 8d4a5d83d7be4d7807feabe38d37704c92d40900, from main, August 1st.
    • KLEIDIAI_HASH to 8ca226712975f24f13f71d04cda039a0ee9f9e2f, v1.12 from main.
  • Update the examples/transformers_llm_text_gen.py to use the new quantizer api Int8DynamicActivationIntxWeightConfig.
  • Deleted torchchat_llm_text_gen.py.
  • Removed Dockerfile lines cloning TorchChat repo and setting safe.directory.
  • Updates huggingface_hub to 0.34.0.

Removed

Fixed

  • Updates various Python packages to address known vulnerabilies with a high CVSS score:
    • Updates Transformers to 4.55.2, this also provides a mitigation for CVE-2025-2099.
    • Updates Wheel version to 0.38.0 as a mitigation for CVE-2022-40898.
    • Updates setup-tools to 78.1.1 as a mitigation for CVE-2025-47273 and CVE-2024-6345.
    • Updates Torchvison to 0.23 to avoid the need to use --extra-index-url
      this is the recomended mitgation against CVE-2018-20225, affecting all versions of pip.

Known issues

  • transformers_llm_text_gen.py example does not currently work in compiler mode.
  • Sample image downloads from https://upload.wikimedia.org may encounter a 403 response. The sample image can be downloaded separately via curl if needed.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.08/ML-Frameworks/tensorflow-aarch64/README.md

Changed

  • Updates TensorFlow hash to ab8aab720f1648f6a470b159b0d1aea3a5b0df81, 2.20.0-dev0 from master, 25th July 2025

Fixed

  • Updates Transformers to 4.50 as a mitigation for CVE-2025-2099.

Full list of PRs

  • Add smoke tests to precommit ci by @Ryo-not-rio in #341
  • build: update TF hashes by @Sqvid in #346
  • fix: correct typo in stderr redirect by @Sqvid in #347
  • [Update] Update new quantizer API usage, bump TorchAO version, and remove torchchat. by @gausah-arm in #348
  • Bumps hashes for PyTorch build ahead of 25.08 release. by @nSircombe in #349
  • Add unit tests to ci by @Ryo-not-rio in #345
  • pytorch: re-pin and reduce build pip requirements by @jondea in #351
  • pytorch: fix dockerize and deduplicate build logic by @jondea in #352
  • onednn - add brgemm bf16 patch by @aditew01 in #354
  • Adds PyTorch patch to implement bf16 GELU via LUT. by @nSircombe in #353
  • r25.08 updates by @nSircombe in #355
  • Updates requirements to add setuptools back by @nSircombe in #357
  • Minor corrections to CHANGELOG and examples README by @nSircombe in #358
  • Updates to CHANGELOG and docs. by @nSircombe in #359

Full diff: r25.07...r25.08

Tool Solutions: ML Frameworks r25.07

14 Jul 14:36
df2eceb

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.07/ML-Frameworks/pytorch-aarch64/README.md

Changed

  • Updates Dockerfile to minimise layers in the final image.
  • Updates hashes for:
    • PyTorch to 5dfd8a9c7a464bb42e81b8594eefd2fa865e5423, to 2.9.0.dev20250703 from viable/strict
    • ideep to 6eb12eaad5e0f7d8c8613c744ac8ba5a0843cb99, from ideep_pytorch, July 3rd
    • oneDNN to 0abfca1947b53c03ee74207e4710941ab6456f3b, from main, July 3rd
    • Compute Library to f69b48afcc59f1b3b0d4544289249bebba489f0a, from main, June 26th

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.07/ML-Frameworks/tensorflow-aarch64/README.md

Changed

  • Updates Dockerfile to minimise layers in the final image.
  • Updates TensorFlow hash to 70d137e8794f97728e094608ec9ea1c6c9fd42af, 2.20.0-dev0 from nightly, July 1st

Full list of PRs

Full Diff: r25.06...r25.07

Tool Solutions: ML Frameworks r25.06

23 Jun 08:41
8f49d77

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.06/ML-Frameworks/pytorch-aarch64/README.md

Added

  • Adds WIP patch to update the OpenBLAS commit - pytorch/pytorch#151547
    • Speedups across most of the HF models via significant boost to SDPA layers.
    • Overall torch-bench pass-rate increased
  • Adds support in github-apply-patch for local caching of patches.

Changed

  • Updates protobuf from 5.29.2 to 5.29.5 in response to CVE-2025-4565
  • Updates hashes for:
    • PyTorch to 3040ca6d0f8558e39919b14eebeacc34ddf980f5 2.8.0.dev20250611 from viable/strict
    • ideep to 2ef932a861439e4cc9bb8baee8424b57573de023 from ideep_pytorch, June 10.
    • oneDNN to 106a7b41bc4156297b8a88cd1951304b739cc427 form main, June 10th
    • Compute Library to 6bc1c7b8d0756272e2a97a7489e13de90f864326 from main, June 9th

Removed

  • Removes WIP patches which have now landed in the upstream nightly PyTorch builds.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.06/ML-Frameworks/tensorflow-aarch64/README.md

Added

  • Adds support in github-apply-patch for local caching of patches.
  • Adds WIP patch to tensorflow/tensorflow#93951
    to update to oneDNN-3.7 + ACL-24.12, and address unit test failures

Changed

  • Updates TensorFlow hash to 65781570c55d2338106767de200323f123c3f91f
    tensorflow-2.20.0.dev0 from main

Complete List of PRs

  • Bump commits and update patches for 25.06 by @murste01 in #332
  • Bump protobuf from 5.29.2 to 5.29.5 in /ML-Frameworks/pytorch-aarch64 by @dependabot in #333
  • Re-add patch which pins PyTorch requirements by @jondea in #334
  • Add Update OpenBLAS Commit patch to PyTorch by @murste01 in #336
  • Add patch cache to github-apply-patch by @jondea in #335
  • Update CHANGELOGs for 25.06 release. by @nSircombe in #337

New Contributors

Full Diff: r25.05...r25.06

Tool Solutions: ML Frameworks r25.05

16 May 11:50
58b44ac

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.05/ML-Frameworks/pytorch-aarch64/README.md

Added

  • WIP patch to build a newer version of libgomp from source - pytorch/pytorch#152361
    Improve scaling for >16 threads.
  • Support for authenticated GitHub access in apply-github-patch

Changed

  • Updates hashes for:
    • PyTorch e872bf8f888bdbb27a03e03935db61babf7180b8, 2.8.0.dev20250430 from viable/strict
    • ideep to 2ef932a861439e4cc9bb8baee8424b57573de023 from ideep_pytorch
    • oneDNN to 69150ce5fe1f453af9125ca42a921e017092ccf7 from main
    • Compute Library to 334108c0efc512efdc9576ba957dbcf5b7ee168a, rc_25_04_29_0
  • Updates existing WIP patches.

Removed

  • Removes WIP patches which have now landed in the upstream nightly PyTorch builds.

Fixed

  • Reverted this commit to add back --compile option. The issue was fixed upstream and is included in the bumped hash

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.05/ML-Frameworks/tensorflow-aarch64/README.md

Added

  • Support for authenticated GitHub access in apply-github-patch

Complete List of PRs

New Contributors

Full Changelog: r25.04...r25.05

Tool Solutions: ML Frameworks r25.04

22 Apr 17:32
95928c1

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.04/ML-Frameworks/pytorch-aarch64/README.md

Added

  • Work in progress oneDNN patch, Enable jit conv for 128 with ~30% speed up for backward convolutions
  • Add --wheel-only flag for only building the torch wheel
  • Work in progress PyTorch patch for pinning the major versions of build dependencies - pytorch/pytorch#150833

Changed

  • Updates hashes for:
    • PyTorch fc674b45d4d8edfd4c630d89f71ea9f85a2f61f2, 2.8.0.dev20250403 from viable/strict
    • ideep to 719d8e6cd7f7a0e01b155657526d693acf97c2b3 from ideep_pytorch
    • oneDNN to 5de25f354afee38bf2db61f485c729d30f62c611 from main
    • Compute Library to 9033bdacdc3840c80762bc56e8facb87b0e1048e, 25.03 release
    • OpenBLAS to edef2e4441e50e3a2da1920fdbde09101087c43d from main
  • Updates existing WIP patches.
  • Update torchvision from 0.22.0.dev20250305 to a more recent nightly build, 0.22.0.dev20250403

Removed

  • Removes WIP patches which have now landed in the upstream nightly PyTorch builds.
  • Removes --tags --force from git clone command, and adds --depth=1 to speedup the checkout.
  • Temporarily removes --compile option from some examples due to an issue with pytorch/pytorch#147151
    the compile path does not work as expected in these cases.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.04/ML-Frameworks/tensorflow-aarch64/README.md

Added

Changed

Complete list of Pls

  • fix: typo in stdout to stderr redirection by @Sqvid in #312
  • Bump hashes ahead of r25.04 release. by @nSircombe in #313
  • Adds back BF16 SVE intrinsics via new patch by @nSircombe in #314
  • simplify apply-github-patch to implicitly use GitHub URL by @Sqvid in #315
  • Update torchvision version and PyTorch hash to fix vision test failure. by @nSircombe in #316
  • add common dirs and files to gitignore by @Sqvid in #317
  • Add oneDNN sve_128 jit conv patch for PyTorch by @jondea in #319
  • build TensorFlow against + oneDNN 3.7 + ACL 24.12 by @Sqvid in #318
  • More dev improvements by @jondea in #320
  • Pin major versions of PyTorch build deps by @jondea in #321
  • update pytorch src's by @aditew01 in #322
  • Update examples and CHANGELOG for r25.04 by @nSircombe in #323

New Contributors

Full Diff: r25.03.1...r25.04

Tool Solutions: ML Frameworks r25.03.1

27 Mar 08:24
a8998a9

Choose a tag to compare

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.03.1/ML-Frameworks/pytorch-aarch64/README.md

Changed

  • Move torch* packages to latest stable to fix build error
    • torchaudio==2.6.0.dev20250305 -> 2.6.0
    • torchdata~=0.7.1 -> 0.11.0
    • torchvision~=0.22.0.dev20250305 -> 0.21.0
    • torchtune==0.6.0.dev20250115 -> 0.5.0

Fixed

  • Fix build error due to missing torchtune nightly build
  • Fix build error due to auditwheel version change by applying commits from pytorch/pytorch#149471

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.03.1/ML-Frameworks/tensorflow-aarch64/README.md
No changes to the TensorFlow build with this release.

Complete list of PRs

  • Fix missing torch* nightlies by using latest stable by @jondea in #307
  • Fix build error due to auditwheel version change by @jondea in #308
  • Update CHANGELOGs for patch release. by @nSircombe in #309
  • Add missing deps for torchchat example by @jondea in #310
  • Update CHANGELOGs for patch release. by @nSircombe in #311

Full Diff: r25.03...r25.03.1