Skip to content

torch_xla::runtime::PjRtComputationClient::ExecuteReplicated() crash after RuntimeError #8973

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
tengyifei opened this issue Apr 14, 2025 · 0 comments
Assignees
Labels
bug Something isn't working xla:tpu TPU specific issues and PRs

Comments

@tengyifei
Copy link
Collaborator

tengyifei commented Apr 14, 2025

See crash at: https://github.com/tengyifei/playground/blob/master/torch-xla-device.ipynb

Minimal repro:

import torch
import torch_xla

a = torch.tensor([1.0], device="xla:1")
b = torch.tensor([1.0], device="xla:2") 
print(a.device, b.device)

a + b

a + b.cpu()

import torch_xla.runtime as xr
xr.use_spmd()

c = torch.tensor([1.0], device="xla") 
c.device, c + c
@ysiraichi ysiraichi added bug Something isn't working xla:tpu TPU specific issues and PRs labels Apr 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working xla:tpu TPU specific issues and PRs
Projects
None yet
Development

No branches or pull requests

2 participants