XLAShardedTensor.to_local() support #9505

Hoomaaan · 2025-07-24T00:19:36Z

The implementation adds a to_local() method to XLAShardedTensor class that converts a sharded tensor back to its local representation while preserving gradient information.

Core Functionality:

Returns the global tensor representation containing combined data across all devices
Maintains the same device placement as the original XLAShardedTensor
Creates a clone of the global tensor to ensure data independence

Gradient Handling:

Preserves the requires_grad setting from the original tensor
Maintains gradient values when converting to local representation
Ensures proper gradient flow through the converted tensor

Test Coverage:

Basic functionality test verifying shape and value preservation
Dedicated gradient flow test ensuring:
requires_grad property is preserved
gradients are properly calculated and maintained
backward pass works correctly through the local tensor
gradient values are accurately preserved

test/spmd/test_xla_dtensor_to_local.py

…ation

…ure proper XLA support and maintain consistency with PyTorch/XLA SPMD integration.

rpsilva-aws · 2025-08-15T16:55:00Z

torch_xla/distributed/spmd/xla_sharded_tensor.py

+    # Since global tensor is detached, add requires_grad and grad values back to the local tensor
+    if self.requires_grad:
+      result.requires_grad = self.requires_grad
+      result.grad = self.grad


The grad doesn't need to be cloned? Is it fine that we break the reference for the tensor, but not its grad? See if we can add a test that shows it is the case (e.g. both the prior and newer tensor updating the same grad).

Updated to clone the self.grad if available

rpsilva-aws · 2025-08-15T16:55:14Z

torch_xla/distributed/spmd/xla_sharded_tensor.py

+    result = self.global_tensor.clone()
+    # Since global tensor is detached, add requires_grad and grad values back to the local tensor
+    if self.requires_grad:
+      result.requires_grad = self.requires_grad


nit: result.requires_grad_(self.requires_grad) for in-place.

Hoomaaan force-pushed the toLocal_wspec branch 2 times, most recently from f0c89b9 to 933a964 Compare July 24, 2025 20:39

rpsilva-aws requested review from bfolie, zhanyong-wan and rpsilva-aws July 30, 2025 17:21

bfolie reviewed Aug 2, 2025

View reviewed changes

test/spmd/test_xla_dtensor_to_local.py Outdated Show resolved Hide resolved

bfolie reviewed Aug 2, 2025

View reviewed changes

test/spmd/test_xla_dtensor_to_local.py Outdated Show resolved Hide resolved

aws-cph and others added 8 commits August 13, 2025 00:13

:qImplement XLAShardedTensor._spec and test

e1e1742

Removed auto wrapping sharding propagation, added cached spec invalid…

1e0ea41

…ation

Removing lazy import

97f67e2

Added test for catching thrown error in spec

7322629

Test for Routing XLA device handling through distribute_tensor to ens…

91e49ee

…ure proper XLA support and maintain consistency with PyTorch/XLA SPMD integration.

[XLA] Implement XLAShardedTensor.to_local()

e768a54

run git_fix for yapf

080e7d5

Remove print statement

6dc2351

Hoomaaan force-pushed the toLocal_wspec branch from 812a69a to 6dc2351 Compare August 13, 2025 00:26

Hoomaaan added 2 commits August 13, 2025 00:28

code clean up

132349a

add trailing new line

6889e5b

Hoomaaan requested a review from bfolie August 13, 2025 00:30

Remove redundant setUpClass constructor

d7bc294

bfolie approved these changes Aug 14, 2025

View reviewed changes

rpsilva-aws requested changes Aug 15, 2025

View reviewed changes

Clone the grads and use inplace method for requires_grad

26b7efc

rpsilva-aws approved these changes Aug 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

XLAShardedTensor.to_local() support #9505

XLAShardedTensor.to_local() support #9505

Hoomaaan commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

rpsilva-aws Aug 15, 2025

Uh oh!

Hoomaaan Aug 15, 2025

Uh oh!

rpsilva-aws Aug 15, 2025

Uh oh!

Hoomaaan Aug 15, 2025

Uh oh!

Uh oh!

XLAShardedTensor.to_local() support #9505

Are you sure you want to change the base?

XLAShardedTensor.to_local() support #9505

Conversation

Hoomaaan commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

rpsilva-aws Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Hoomaaan Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

rpsilva-aws Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Hoomaaan Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!