Skip to content

Comments

Fix/megatron/megatron fsdp patches#553

Merged
wenxie-amd merged 1 commit intomainfrom
fix/megatron/megatron-fsdp-patches
Feb 25, 2026
Merged

Fix/megatron/megatron fsdp patches#553
wenxie-amd merged 1 commit intomainfrom
fix/megatron/megatron-fsdp-patches

Conversation

@jasainio
Copy link
Contributor

Adds a patch to fix Megatron FSDP compatibility with PyTorch 2.10+. The patch updates get_mesh_names to use the new DeviceMesh API (_get_root_mesh() and _flatten_mapping) instead of the deprecated _mesh_resources.child_to_root_mapping removed in PyTorch 2.10. The patch is automatically applied when use_megatron_fsdp is enabled.

@jasainio jasainio force-pushed the fix/megatron/megatron-fsdp-patches branch 3 times, most recently from 00239a9 to 8002c01 Compare February 10, 2026 13:20
@jasainio jasainio force-pushed the fix/megatron/megatron-fsdp-patches branch from 8002c01 to 1a4685c Compare February 10, 2026 13:36
@wenxie-amd wenxie-amd merged commit fc3f6b2 into main Feb 25, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants