[gdn] support GDN CP by Jintao-Huang · Pull Request #16 · modelscope/mcore-bridge

Jintao-Huang · 2026-04-05T18:41:04Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces support for context parallelism (CP) in the GatedDeltaNet module by adjusting sequence length calculations and implementing all-to-all communication between CP and head parallelism domains. The changes also include CP-aware parameter fetching for convolutions and gate calculations. Review feedback identified several critical issues, including a missing import for tensor_a2a_hp2cp, a NameError caused by a missing self. reference to A_log, and a potential configuration error in the groups parameter of the manual F.conv1d call.

src/mcore_bridge/model/modules/gated_delta_net.py

Jintao-Huang · 2026-04-06T07:59:40Z

/gemini review

gemini-code-assist

Code Review

This pull request implements Context Parallelism (CP) support within the GatedDeltaNet module, including adjustments for sequence length, parameter slicing for depthwise convolutions, and All-to-All communication for projections. The review feedback identifies critical issues with the sequence unpacking logic used during All-to-All operations, which is noted as being both inefficient and incorrect for packed sequences. Additionally, a bug was found in the F.conv1d call where the groups parameter could be incorrectly set to None, and it was suggested to remove the _unpack_sequence helper function to simplify the implementation.

src/mcore_bridge/model/modules/gated_delta_net.py

Jintao-Huang · 2026-04-06T08:05:56Z

NVIDIA/Megatron-LM#2644

support GDN CP

f65be60

gemini-code-assist bot reviewed Apr 5, 2026

View reviewed changes

src/mcore_bridge/model/modules/gated_delta_net.py Outdated Show resolved Hide resolved

src/mcore_bridge/model/modules/gated_delta_net.py Outdated Show resolved Hide resolved

src/mcore_bridge/model/modules/gated_delta_net.py Outdated Show resolved Hide resolved

Jintao-Huang added 3 commits April 6, 2026 15:08

update

3c55565

update

17497a4

update

89b6d59

Jintao-Huang mentioned this pull request Apr 6, 2026

[megatron] support qwen3.5 CP modelscope/ms-swift#9022

Merged

hjh0119 approved these changes Apr 6, 2026

View reviewed changes

gemini-code-assist bot reviewed Apr 6, 2026

View reviewed changes

fix

86144c0

Jintao-Huang merged commit 63e9036 into modelscope:main Apr 6, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gdn] support GDN CP#16

[gdn] support GDN CP#16
Jintao-Huang merged 5 commits intomodelscope:mainfrom
Jintao-Huang:support_GDN_CP

Jintao-Huang commented Apr 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jintao-Huang commented Apr 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants