Skip to content

Releases: NVIDIA-NeMo/Run

NVIDIA NeMo Run 0.8.0

26 Feb 02:21
8c55dcc

Choose a tag to compare

Executors

  • fix(typo): exit_code prints empty #379
  • fix: limit docker hostname to 32 characters #378
  • add secrets to lepton #383
  • Add RayCluster support for DGX Cloud Lepton #389
  • Fix AssertionError: no app_id collisions expected when scheduling JobGroups locally #404
  • feat: add het-job support for ray slurm #407
  • feat: use slurm executor to get ray template name #410

Ray Integration

  • Update ray template #375
  • fix ray templates by using --exclusive to launch ray nodes #380
  • Revert "fix ray templates by using --exclusive to launch ray nodes (#380) #384
  • Add RayCluster support for DGX Cloud Lepton #389
  • Update ray_enroot template #406
  • feat: add het-job support for ray slurm #407
  • feat: use slurm executor to get ray template name #410

Experiment & Job Management

  • Add RayCluster support for DGX Cloud Lepton #389
  • Fix AssertionError: no app_id collisions expected when scheduling JobGroups locally #404
  • feat: add het-job support for ray slurm #407

Documentation

  • fix: limit docker hostname to 32 characters #378
  • fix: Update README.md #388
  • fix broken links in README.md #386
  • docs: Fix broken links in README and CONTRIBUTING #390
  • Add RayCluster support for DGX Cloud Lepton #389
  • docs: Release docs #412
  • cp: ci: Update release-docs workflow to use FW-CI-templates v0.72.0 (423) into r0.8.0 #424
  • cp: ci: Update release workflow to include changelog and docs (426) into r0.8.0 #427
  • docs: Update docs for 0.8.0 #428
  • docs: Update docs to include nightly and use latest #431

CI/CD

  • Update ray template #375
  • Update changelog for r0.7.0 #396
  • cp: ci: Update release-docs workflow to use FW-CI-templates v0.72.0 (423) into r0.8.0 #424
  • cp: ci: Update release workflow to include changelog and docs (426) into r0.8.0 #427

Bug Fixes

  • fix host #373
  • fix ray templates by using --exclusive to launch ray nodes #380
  • fix(typo): exit_code prints empty #379
  • fix: limit docker hostname to 32 characters #378
  • fix: Update README.md #388
  • fix broken links in README.md #386
  • Revert "fix ray templates by using --exclusive to launch ray nodes (#380) #384
  • docs: Fix broken links in README and CONTRIBUTING #390
  • fix: Retry polling token #392
  • fix: DGXC streaming #401
  • Fix AssertionError: no app_id collisions expected when scheduling JobGroups locally #404
  • fix: remove unexpected side effect in get_srun_flags #408
  • fix: Search for incluster config if no kubeconfig is given #411
  • fix: Pass DGXC to ft_launcher #402
  • cp: Fix uv sync error (#422) into r0.8.0 #425

Others

  • Version bump to 0.8.0rc0.dev0 #368
  • feat: add copyright check #369
  • feat: copyright check #370
  • Add port parameter to SSHTunnel #372
  • update copyright check version #376
  • feat: Stream DGXC logs #377
  • feat: Stream logs to disk #393
  • Update nvidia-sphinx-theme #398

NVIDIA NeMo Run 0.7.0

03 Dec 23:54
bbdea4c

Choose a tag to compare

NVIDIA Nemo Run 0.7.0

Detailed Changelogs:

Executors

  • Add image pull secrets param for lepton #330
  • Add node reservations for LeptonExecutor #336
  • [SkyPilot] Fix nodes -> num_nodes for SkyPilotExecutor in docs #338
  • [SkyPilot] Add retry_until_up as an optional arg to SkyPilot Executor #340
  • Support SkyPilot Storage configurations in file_mounts for automatic cloud sync #335
  • [SkyPilot] Update YAML dump imports + backward compatibility for SkyPilot <=0.10.3 #339
  • Create SkypilotJobsExecutor to allow running managed jobs #343
  • fix: exit code docker runs #365

Ray Integration

  • Add ray head start timeout #324
  • Remove ray deprecated dashboard-grpc-port arg #325

Experiment & Job Management

  • add a grace for Jobs that may start in Unknown #291
  • Create SkypilotJobsExecutor to allow running managed jobs #343

Packaging & Deployment

  • Support SkyPilot Storage configurations in file_mounts for automatic cloud sync #335
  • Refactor tar packaging logic to work for submodule and extra repo #347

Documentation

  • Add broken links check in docs #333
  • [SkyPilot] Fix nodes -> num_nodes for SkyPilotExecutor in docs #338
  • Documentation Restructurting #350
  • Fix spelling in docstring #359
  • fix: exit code docker runs #365

CI/CD

  • Update cherry-pick workflow to use version 0.63.0 #344
  • fix: exit code docker runs #365

Bug Fixes

  • [SkyPilot] Fix nodes -> num_nodes for SkyPilotExecutor in docs #338
  • Fix spelling in docstring #359
  • fix: exit code docker runs #365

Others

  • chore: Bump to version 0.7.0rc0.dev0 #322
  • Update community-bot to add community issues to shared project #321
  • Bump community-bot to 0.54.4 #332
  • remove custom dir #351
  • Bumping to 0.5.0 #352
  • Update release notes header in changelog build #355
  • add changelog-config #356
  • Changelog 0.6.0 #357
  • feat: new changelog-build #367

NVIDIA NeMo Run 0.7.0rc0.dev0

03 Dec 23:25
dc86aea

Choose a tag to compare

Pre-release

Prerelease: NVIDIA NeMo Run 0.7.0rc0.dev0 (2025-12-03)

NVIDIA NeMo Run 0.6.0

09 Oct 16:13
030f862

Choose a tag to compare

NVIDIA Nemo Run 0.6.0

Detailed Changelogs:

Executors

  • Added Pre-Launch Commands Support to LeptonExecutor #312
  • Remove breaking torchrun config for single-node runs #292
  • Upgrade skypilot to v0.10.0, introduce network_tier #297
  • Fixes for multi-node execution with torchrun + LocalExecutor #251
  • Add option to specify --container-env for srun #293
  • Fix skypilot archive mount bug #288
  • finetune on dgxcloud with nemo-run and deploy on bedrock example #286

Ray Integration

  • Add nsys patch in ray sub template #318
  • Add logs dir to container mount for ray slurm #287
  • Allow customizing folder for SlurmRayRequest #281

CLI & Configuration

Experiment & Job Management

  • Use thread pool for status, run methods inside experiment + other fixes #295

Packaging & Deployment

  • Correctly append tar files for packaging #317

Documentation

  • Create CHANGELOG.md #314
  • docs: Fixing doc build issue #290
  • fix docs tutorial links and add intro to guides/index.md #285
  • README #277

CI/CD

  • changelog workflow #315
  • Update release.yml #306
  • ci(fix): Use GITHUB_TOKEN for community bot #302
  • ci: Add community-bot #300

Bug Fixes

  • [Bugfix] Adding a check for name length #273
  • misc fixes #280
  • adding fix for lowercase and name length k8s requirements #274

Others

  • Specify nodes for gpu metrics collection and split data to each rank #320
  • Apply '_enable_goodbye_message' check to both goodbye messages. #319
  • Update refs #278
  • chore: Bump to version 0.6.0rc0.dev0 #272

NVIDIA NeMo Run 0.6.0rc0.dev0

09 Oct 05:53
d01f76a

Choose a tag to compare

Pre-release

Prerelease: NVIDIA NeMo Run 0.6.0rc0.dev0 (2025-10-09)

NVIDIA NeMo Run 0.5.0

04 Aug 21:10
b234cfd

Choose a tag to compare

Features and improvements

NVIDIA NeMo Run 0.4.0

09 May 00:58
33458c8

Choose a tag to compare

Features and improvements.

NVIDIA NeMo Run 0.4.0rc2.dev0

07 Apr 22:39
0ba2f18

Choose a tag to compare

Pre-release

Prerelease: NVIDIA NeMo Run 0.4.0rc2.dev0 (2025-04-07)

NVIDIA NeMo-Run 0.3.0

14 Mar 00:13
v0.3.0
451f781

Choose a tag to compare

v0.3.0

Tag v0.3.0