MPI shared memory #14

hmenke · 2023-09-26T17:01:29Z

All tests were run with multiple nodes and different number of slots on each node:

$ cat hostfile 
pcscqm04 slots=4
pcscqm05 slots=3
pcscqm06 slots=5
pcscqm07 slots=2
$ mpirun -hostfile ./hostfile build/test/c++/mpi_window

Some ideas:

~~MPI allocator~~

Similar to std::allocator implement a shared_allocator and a distributed_shared_allocator, such that one can use e.g. std::vector<double, mpi::shared_allocator<double>>

Questions:
- ~~On top of that, for distributed shared memory access must be fenced and broadcasted between nodes. That's not so easy to abstract away.~~

test/c++/mpi_allocator.cpp

c++/mpi/window.hpp

hmenke

Can we make this non-MPI compatible?

c++/mpi/window.hpp

This adds a new abstraction for MPI_Group to be able to use the post-start-complete-wait RMA cycle. Also adds documentation and more tests. Co-authored-by: Mohamed Aziz Bellaaj <[email protected]>

- update docs - wrap MPI calls with check_mpi_call - remove noexcept if not necessary

- update docs - simplify implementation

- update docs - remove get_attr and return the stored attributes instead - remove noexcept

- update docs - remove noexcept when not necessary

For consistency with mpi::window which uses comm_, rename the communicator member variable from com_ to comm_ throughout the mpi::communicator class.

- Change window<T>::size() to return element count for consistency - Update test expectation accordingly Co-Authored-By: Claude <[email protected]>

hmenke

@Wentzell LGTM, only minor nitpick.

test/c++/mpi_window.cpp

Co-authored-by: Henri Menke <[email protected]>

Wentzell

Thank you @hmenke for this massive feature upgrade! The PR was already in great condition. I have iterated once on it, and from my side it is now ready for merge. Please confirm once the two most recent changes in e45281f and c40fbee

Wentzell · 2025-10-03T19:13:26Z

Thank you also @Thoemi09 for your improvements

hmenke

LGTM, but could you please not squash on merge? This makes it much easier to bisect later if something is wrong. You can clean up the history and rebase on unstable if you want.

Wentzell · 2025-10-03T19:43:31Z

LGTM, but could you please not squash on merge? This makes it much easier to bisect later if something is wrong. You can clean up the history and rebase on unstable if you want.

I would have been ok with a full squash, but if you want group things into a few commits along the way, feel free to create a suggested a grouping.

hmenke force-pushed the shm branch 3 times, most recently from 92e2e2b to 1738e29 Compare October 4, 2023 07:41

hmenke changed the title ~~WIP: MPI shared memory~~ MPI shared memory Oct 9, 2023

hmenke marked this pull request as ready for review October 9, 2023 10:00

hmenke force-pushed the shm branch from f009b85 to 879dc69 Compare October 9, 2023 10:04

hmenke commented Oct 9, 2023

View reviewed changes

test/c++/mpi_allocator.cpp Outdated Show resolved Hide resolved

hmenke force-pushed the shm branch from 87ead7d to 0bc73fa Compare October 25, 2023 16:40

hmenke force-pushed the shm branch from 7ff8df7 to 338e2a4 Compare July 3, 2024 15:00

hmenke commented Aug 16, 2024

View reviewed changes

c++/mpi/window.hpp Outdated Show resolved Hide resolved

hmenke commented Aug 16, 2024

View reviewed changes

c++/mpi/window.hpp Outdated Show resolved Hide resolved

hmenke commented Aug 16, 2024

View reviewed changes

c++/mpi/window.hpp Outdated Show resolved Hide resolved

hmenke mentioned this pull request Jan 22, 2025

MPI Window hmenke/mpi#1

Closed

Wentzell force-pushed the unstable branch from 442dac8 to 35eeb64 Compare February 20, 2025 22:05

hmenke force-pushed the shm branch 2 times, most recently from 7a4661b to 9b51e65 Compare February 24, 2025 10:49

Wentzell requested a review from Thoemi09 February 25, 2025 15:15

hmenke mentioned this pull request Mar 27, 2025

Node distributed Green's function mesh-products with global transpose operations TRIQS/triqs#949

Open

hmenke force-pushed the shm branch from 7b911d6 to 3f449d8 Compare July 11, 2025 09:42

hmenke requested a review from Wentzell July 11, 2025 10:29

hmenke force-pushed the shm branch from 3f449d8 to ca9026b Compare July 11, 2025 10:31

hmenke and others added 8 commits October 1, 2025 16:18

First prototype for windows and shared memory

6021278

Add simple shared array test

47a9d00

Add complicated distributed shared array test

f5d36b7

Cover some more MPI_Win_* API surface

2b1d0d9

Expand API surface and test cases

176de86

This adds a new abstraction for MPI_Group to be able to use the post-start-complete-wait RMA cycle. Also adds documentation and more tests. Co-authored-by: Mohamed Aziz Bellaaj <[email protected]>

Add missing headers

651696e

shared communicator constructors

5d48c83

Inherit communicator constructors and pass by ref to window

59930f4

Thoemi09 and others added 7 commits October 1, 2025 16:18

Clean up synchronization ops in mpi::window

c0dbdef

- update docs - wrap MPI calls with check_mpi_call - remove noexcept if not necessary

Clean up get and put methods in mpi::window

5c88faa

- update docs - simplify implementation

Clean up getter methods in mpi::window

2799086

- update docs - remove get_attr and return the stored attributes instead - remove noexcept

Remove data() getters in mpi::window

d0f813c

Clean up mpi::shared_window

57df83e

- update docs - remove noexcept when not necessary

Clean up in mpi_window.cpp

d920b1a

Add a multi-node CI job for MPI Shared Memory

f55b375

Wentzell force-pushed the shm branch from ca9026b to f55b375 Compare October 1, 2025 20:19

Wentzell and others added 7 commits October 1, 2025 16:52

Standardize member variable naming: com_ → comm_

60ba58b

For consistency with mpi::window which uses comm_, rename the communicator member variable from com_ to comm_ throughout the mpi::communicator class.

Minor renamings in build_multi_node workflow

b0b46d4

Minor correction in mpi::group::size() comment

0ae91e2

Fix noexcept issue in window.free() call

a575c43

Apply recent app4triqs changes to build_multi_node changes

de42f78

[window] Return number of elements from size() instead of bytes

2f5f740

- Change window<T>::size() to return element count for consistency - Update test expectation accordingly Co-Authored-By: Claude <[email protected]>

Simplify SharedArray Tests

caa3938

hmenke commented Oct 3, 2025

View reviewed changes

test/c++/mpi_window.cpp Outdated Show resolved Hide resolved

test/c++/mpi_window.cpp Outdated Show resolved Hide resolved

test/c++/mpi_window.cpp Outdated Show resolved Hide resolved

test/c++/mpi_window.cpp Outdated Show resolved Hide resolved

Wentzell and others added 8 commits October 3, 2025 14:59

Remove redundant forward declaration

8cc623e

Fix issue with setting window::owned_ member

c40fbee

Remove redundant communicator.get call

e45281f

Minor clarification in window::query doc string

5e55cb2

Update test/c++/mpi_window.cpp

a8d26cf

Co-authored-by: Henri Menke <[email protected]>

Update test/c++/mpi_window.cpp

d4e7bdd

Co-authored-by: Henri Menke <[email protected]>

Update test/c++/mpi_window.cpp

279b427

Co-authored-by: Henri Menke <[email protected]>

Update test/c++/mpi_window.cpp

3525112

Co-authored-by: Henri Menke <[email protected]>

Wentzell approved these changes Oct 3, 2025

View reviewed changes

Fix construction of shared_comm in shared_window::get_communicator

ab83d92

hmenke commented Oct 3, 2025

View reviewed changes

get() and put() only make sense on the same rank without MPI

22b8dff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MPI shared memory #14

MPI shared memory #14

Uh oh!

hmenke commented Sep 26, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hmenke left a comment

Uh oh!

Uh oh!

hmenke left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wentzell left a comment

Uh oh!

Wentzell commented Oct 3, 2025

Uh oh!

hmenke left a comment

Uh oh!

Wentzell commented Oct 3, 2025

Uh oh!

Uh oh!

MPI shared memory #14

Are you sure you want to change the base?

MPI shared memory #14

Uh oh!

Conversation

hmenke commented Sep 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hmenke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hmenke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Wentzell left a comment

Choose a reason for hiding this comment

Uh oh!

Wentzell commented Oct 3, 2025

Uh oh!

hmenke left a comment

Choose a reason for hiding this comment

Uh oh!

Wentzell commented Oct 3, 2025

Uh oh!

Uh oh!

hmenke commented Sep 26, 2023 •

edited

Loading