-
Couldn't load subscription status.
- Fork 353
Update config.py to negate the dimension issue for FP8 support for AMD GPUs #3246
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Update the config.py for AMD GPUs FP8
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3246
Note: Links to docs will display an error until the docs builds have been completed. This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Hi @kailashg26! Thank you for your pull request and welcome to our community. Action RequiredIn order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you. ProcessIn order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA. Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with If you have received this in error or have any questions, please contact us at cla@meta.com. Thanks! |
|
could you share how this is related to AMD GPUs? From the screenshot in the PR summary, it looks like the shapes being fed through the network do not adhere to the requirements of |
|
@vkuzo I agree, this is GPU independent problem. I just tried on MI300 and MI355 and this problem persists. But when I enable padding it works fine! |
|
got it, can we just enable padding at the callsite for your use case instead of changing the default? |
|
@vkuzo not sure how do we do that at the callsite. You mean before I run my script using something like this?
I was just wondering if this might be very hacky |
|
usually the user creates a config = Float8LinearConfig(...)in the place where that happens in torchtune, you could set the padding flag to True, or make that user configurable. Would that work? |
|
But if we use upstream torchtune we have to submit PR to upstream one right? Not sure if they are actively accepting PRs |
Update the config.py file to fix (“negate”) the dimension mismatch issue that arises when enabling FP8 (8-bit floating point) precision support on AMD GPUs.
Error:
