-
Notifications
You must be signed in to change notification settings - Fork 640
Make fp8 ck gemm support both OCP and FNUZ FP8 #4728
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
This pull request was exported from Phabricator. Differential Revision: D80138910 |
19b0812
to
64fc78c
Compare
Summary: X-link: facebookresearch/FBGEMM#1751 Use type info from ck to tell whether we are using OCP FP8 or FNUZ FP8. Updated fp8_rowwise and fp8_rowwise_preshuffle wrappers. Note that header macros like CK_USE_OCP_FP8 and CK_USE_FNUZ_FP8 could be both defined (if need to support multiple architectures). Reviewed By: mxz297 Differential Revision: D80138910
This pull request was exported from Phabricator. Differential Revision: D80138910 |
64fc78c
to
b403940
Compare
Summary: X-link: facebookresearch/FBGEMM#1751 Use type info from ck to tell whether we are using OCP FP8 or FNUZ FP8. Updated fp8_rowwise and fp8_rowwise_preshuffle wrappers. Note that header macros like CK_USE_OCP_FP8 and CK_USE_FNUZ_FP8 could be both defined (if need to support multiple architectures). Reviewed By: mxz297 Differential Revision: D80138910
b403940
to
6ec6308
Compare
Summary: X-link: facebookresearch/FBGEMM#1751 Use type info from ck to tell whether we are using OCP FP8 or FNUZ FP8. Updated fp8_rowwise and fp8_rowwise_preshuffle wrappers. Note that header macros like CK_USE_OCP_FP8 and CK_USE_FNUZ_FP8 could be both defined (if need to support multiple architectures). Reviewed By: mxz297 Differential Revision: D80138910
This pull request was exported from Phabricator. Differential Revision: D80138910 |
Summary: Pull Request resolved: pytorch#4728 X-link: facebookresearch/FBGEMM#1751 Use type info from ck to tell whether we are using OCP FP8 or FNUZ FP8. Updated fp8_rowwise and fp8_rowwise_preshuffle wrappers. Note that header macros like CK_USE_OCP_FP8 and CK_USE_FNUZ_FP8 could be both defined (if need to support multiple architectures). Reviewed By: mxz297 Differential Revision: D80138910
6ec6308
to
5afcf65
Compare
Summary: Pull Request resolved: pytorch#4728 X-link: facebookresearch/FBGEMM#1751 Use type info from ck to tell whether we are using OCP FP8 or FNUZ FP8. Updated fp8_rowwise and fp8_rowwise_preshuffle wrappers. Note that header macros like CK_USE_OCP_FP8 and CK_USE_FNUZ_FP8 could be both defined (if need to support multiple architectures). Reviewed By: mxz297 Differential Revision: D80138910
This pull request was exported from Phabricator. Differential Revision: D80138910 |
5afcf65
to
43db625
Compare
Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/1751
Use type info from ck to tell whether we are using OCP FP8 or FNUZ FP8.
Updated fp8_rowwise and fp8_rowwise_preshuffle wrappers.
Note that header macros like CK_USE_OCP_FP8 and CK_USE_FNUZ_FP8 could be both defined (if need to support multiple architectures).
Reviewed By: mxz297
Differential Revision: D80138910