Actions: NVIDIA/Megatron-LM
Actions
Showing runs from all workflows
79,095 workflow runs
79,095 workflow runs
--overlap-grad-reduce and --num-distributed-optimizer-instances > 1 due to autograd hook stream affinity
Community Bot
#6844:
Issue #3670
opened
by
zyeric