Skip to content

wqupdate xdnn and pack to support fp8 gemm in prefill #522

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 21, 2025

Conversation

abenmao
Copy link
Contributor

@abenmao abenmao commented Jul 3, 2025

No description provided.

@abenmao abenmao requested a review from a3213105 July 8, 2025 02:03
@abenmao abenmao force-pushed the perf/e4m3/pack branch 2 times, most recently from f576f44 to 05e32c8 Compare July 8, 2025 05:41
@abenmao abenmao force-pushed the perf/e4m3/pack branch 2 times, most recently from a9c7641 to b4a4193 Compare July 21, 2025 05:12
int amx_cols = (int)((N + 63) / 64) * 64;
if (!weight.isShadow()) weight.Resize(amx_rows, amx_cols);
memset(weight.Data(), 0, sizeof(e4m3_t) * amx_rows * amx_cols);
int blockSize = 32;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rename blockSize to packedBlockSize to avoid potential naming conflicts with blocksize in the future "assert(blockSize == 128);"

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done~

@a3213105 a3213105 self-requested a review July 21, 2025 06:36
@abenmao abenmao merged commit 820dab5 into intel:main Jul 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants