Skip to content

Optimize group_index_select_or_add_2d_kernel on ROCm by adding a separate codepath for small embedding dimensions#5233

Open
aryaman-gupta wants to merge 6 commits intopytorch:mainfrom
ROCm:aryaman/group-index-subwarp