You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This affects both fuse_gemm and fuse_moe. Possibly hard to fix if everything is done on the GPU due to memory fragmentation from re-allocating the fused weights...
fuse_gemm transformation at the moment consumes a lot of memory causing OOM errors. The fuse_gemm is disabled for now and we should fix it eventually
The text was updated successfully, but these errors were encountered: