Open
Description
I see there is a DeviceMesh abstraction in spmd
: https://github.com/pytorch/PiPPy/blob/main/spmd/tensor/device_mesh.py
Can we use this abstraction as shared infrastructure? For example, PipelineDriver.init_data_parallel
[https://github.com/pytorch/PiPPy/blob/877eb8c675dd0e34731961c043f8ae2cc1e49a77/pippy/PipelineDriver.py#L461] rolls a lot of these concepts by hand; can we use DeviceMesh here?