Unnecessary FP64 cast for `padding_idx` in `EmbeddingDenseBackward`

[torch_xla/csrc/tensor_ops.cpp](https://github.com/pytorch/xla/blob/master/torch_xla/csrc/tensor_ops.cpp#L232)[ #L232](https://github.com/pytorch/xla/blob/master/torch_xla/csrc/tensor_ops.cpp#L232) converts an `int64_t padding_idx` to **`double`**.
This introduces FP64 ops even though `indices_rank1` is always `int32` or `int64`, while the upstream ATen implementation keeps everything in integer space.

### Questions

* Was the double cast introduced for a specific historical or hardware reason?
* How is this handled on devices without native FP64 support (e.g. TPU)?

### Proposed fix

1. Keep `padding_idx` as `int64_t`, **or**
2. Cast it to the same dtype as `indices_rank1` before comparison.

Either option would avoid unnecessary FP64 operations and align with ATen’s behavior.

Thanks for taking a look!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unnecessary FP64 cast for `padding_idx` in `EmbeddingDenseBackward` #9392

Questions

Proposed fix

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unnecessary FP64 cast for padding_idx in EmbeddingDenseBackward #9392

Description

Questions

Proposed fix

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Unnecessary FP64 cast for `padding_idx` in `EmbeddingDenseBackward` #9392