Describe the feature🚀
现在Device Graph Runner只有同步语义,会阻塞当前的线程和stream,我们希望有一套兼容异步语义回调和runner forward的设计
|
assert self._copy_to_param_buffer(*args, **kwargs) |
References
https://docs.pytorch.org/docs/stable/generated/torch.cuda.Stream.html
https://docs.pytorch.org/docs/main/notes/cuda.html
Environment details
XpuGrpah: 0.10.1
Describe the feature🚀
现在Device Graph Runner只有同步语义,会阻塞当前的线程和stream,我们希望有一套兼容异步语义回调和runner forward的设计
xpu_graph/src/xpu_graph/device_graph_runner.py
Line 139 in fa37ad3
References
https://docs.pytorch.org/docs/stable/generated/torch.cuda.Stream.html
https://docs.pytorch.org/docs/main/notes/cuda.html
Environment details
XpuGrpah: 0.10.1