Optimize group_index_select_or_add_2d_kernel on ROCm by adding a separate codepath for small embedding dimensions #5233
+96
−18
Meta CodeSync / Import Status
succeeded
Dec 18, 2025 in 4h 12m 34s
PR has been imported
@q10 has imported this pull request.
Loading