Skip to content

Conversation

@hppritcha
Copy link
Member

the communicator in the message struct was not being added to the returned request.

Related to issue #13484

the communicator in the message struct was not being
added to the returned request.

Related to issue open-mpi#13484

Signed-off-by: Howard Pritchard <[email protected]>
@hppritcha
Copy link
Member Author

@devreal see if this helps for ASAN.

Note that i've not usually tried to run mpi4py with UCX PML. You have to skip test_ulfm among other things.

@devreal
Copy link
Contributor

devreal commented Jan 7, 2026

Thanks @hppritcha! It seems to fix the issue. I wonder why we don't need to retain the communicator though. It could be released by the application once they got the message object. Or is the communicator retained already?

I opened #13623 for the pml/ucx ULFM issue. We probably should start running the mpi4py test suite with UCX as well.

@hppritcha
Copy link
Member Author

UCX PML doesn't seem to like retaining/releasing the comm associated with a request. See for example the mca_pml_ucx_irecv function. a request object is "returned" from the ucp_tag_recv_nb UCX method.

@hppritcha
Copy link
Member Author

ignoring non-functional NVIDIA CI

@hppritcha hppritcha merged commit 6c47845 into open-mpi:main Jan 8, 2026
17 of 21 checks passed
@janjust
Copy link
Contributor

janjust commented Jan 8, 2026

@hppritcha does this need to be cherry-picked to v5.0?

@hppritcha
Copy link
Member Author

yes I think it should. on my to-do list!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants