Why not use this MPI library for CPU-CPU data transfers? https://github.com/openucx/ucc
Why not use this MPI library for CPU-CPU data transfers?
https://github.com/openucx/ucc