[AutoDeploy] Investigate torch.export as a preprocessing step to InferenceOptimizer #4704

lucaslie · 2025-05-27T17:56:15Z

Currently, we run export on every rank

this can overload the CPU since every rank needs to run export on the CPU
May cause significant slow downs for long export time

To spin this a little further:

Everything until compile/cudagraph is very CPU heavy and broadly speaking the model is on the meta device and every rank performs the same set of computation. We may want to consider moving as much as we can into the pre-processing stage...

lucaslie self-assigned this May 27, 2025

lucaslie added the AutoDeploy label May 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoDeploy] Investigate torch.export as a preprocessing step to InferenceOptimizer #4704

[AutoDeploy] Investigate torch.export as a preprocessing step to InferenceOptimizer #4704

lucaslie commented May 27, 2025

[AutoDeploy] Investigate torch.export as a preprocessing step to InferenceOptimizer #4704

[AutoDeploy] Investigate torch.export as a preprocessing step to InferenceOptimizer #4704

Comments

lucaslie commented May 27, 2025