Enable sharding for Zarr files for external aerodynamics pipeline by saikrishnanc-nv · Pull Request #41 · NVIDIA/physicsnemo-curator

saikrishnanc-nv · 2025-12-02T23:44:56Z

This PR enables sharding, for Zarr files produced by the external aerodynamics pipeline.
This follows Zarr docs, and roughly creates ~1 GB shards, each of which contain ~1000 chunks each of which are ~1 MB in size.
This is being done to reduce number of files for large files (volume files for example), while maintaining fast random access (because of chunking).

Tests are also being added.

saikrishnanc-nv · 2025-12-02T23:45:22Z

/blossom-ci

coreyjadams

Looks good to me! I have an IO benchmarking script for DrivaerML, we could do some performance tests on the sharding configuration if you want. But it's impossible to hit it perfectly in all cases. Since chunk size and chunks per shard is configurable, these are good defaults.

saikrishnanc-nv added 4 commits December 2, 2025 15:41

Enable sharding for large arrays

2413c4f

Fix incorrect config update

6e63521

Simplified sharding logic

7f09fcc

Fix sharding divisiblity bug

23e949e

saikrishnanc-nv requested a review from coreyjadams December 2, 2025 23:45

saikrishnanc-nv self-assigned this Dec 2, 2025

coreyjadams approved these changes Dec 3, 2025

View reviewed changes

saikrishnanc-nv merged commit 8465c46 into NVIDIA:main Dec 3, 2025
1 check passed

saikrishnanc-nv deleted the saikrishnanc/sharding branch December 3, 2025 21:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable sharding for Zarr files for external aerodynamics pipeline#41

Enable sharding for Zarr files for external aerodynamics pipeline#41
saikrishnanc-nv merged 4 commits intoNVIDIA:mainfrom
saikrishnanc-nv:saikrishnanc/sharding

saikrishnanc-nv commented Dec 2, 2025

Uh oh!

saikrishnanc-nv commented Dec 2, 2025

Uh oh!

coreyjadams left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saikrishnanc-nv commented Dec 2, 2025

Uh oh!

saikrishnanc-nv commented Dec 2, 2025

Uh oh!

coreyjadams left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants