Skip to content

Conversation

@moven0831
Copy link
Collaborator

No description provided.

moven0831 and others added 28 commits May 29, 2025 16:26
Optimize Performance with Dynamic Thread and Threadgroup Config
…ve the threadgroup params fetching with system-value attributes
Fine-tune Window Size with Input Size
Refactor/msl level optimization
docs: add benchmark for performance comparisons
Summary of profiling results added.
@moven0831 moven0831 force-pushed the metal-msm-v2 branch 2 times, most recently from 5f26d97 to 5776b39 Compare July 11, 2025 15:43
@moven0831 moven0831 merged commit 1fc11b3 into main Jul 11, 2025
2 checks passed
@moven0831 moven0831 mentioned this pull request Jul 11, 2025
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants