Fallback when mlx5dv is not supported. #1665

casteryh · 2025-10-25T20:04:19Z

Summary:
This change adds fallback support when mlx5dv (Mellanox device-specific extensions) is not available for RDMA operations. It modifies the queue pair creation logic to conditionally use either extended mlx5dv-based queue pairs (when supported) or standard ibverbs queue pairs (as fallback). The pt_cuda_alloc flag is updated to require mlx5dv support since it's necessary for merging memory segments when using PyTorch's CUDA allocator. The change adds a new is_extended parameter to control whether to create extended or standard queue pairs at runtime.

Adds an env variable MONARCH_RDMA_MLX5DV_DISABLED to test the new code path on dev machine.

Changes in Latest Revision

Based on reviewer feedback, the implementation has been updated with a cleaner, configuration-based approach:

API Changes:

Replaced uint8_t is_extended parameter with rdma_qp_type_t enum in C API
Added RdmaQpType enum to Rust with three variants:
- Auto: Auto-detect based on device capabilities (default)
- Standard: Force standard ibverbs queue pairs
- Mlx5dv: Force mlx5dv extended queue pairs
Added qp_type field to IbverbsConfig for explicit QP type control
C code uses switch statement with proper default case for unknown types

Architecture:

Rust resolves Auto mode before calling C (single source of truth for detection)
C function becomes a pure executor - no capability detection logic
Removed environment variable approach in favor of configuration

Testing:

Added setup_with_qp_type() helper function in test utilities
Added 4 new unit tests to verify standard QP fallback path:
- test_rdma_read_into_standard_qp (CPU-to-CPU)
- test_rdma_write_from_standard_qp (CPU-to-CPU)
- test_rdma_read_into_standard_qp_cuda (GPU-to-GPU)
- test_rdma_write_from_standard_qp_cuda (GPU-to-GPU)

Differential Revision: D85504061

meta-codesync · 2025-10-25T20:04:39Z

@casteryh has exported this pull request. If you are a Meta employee, you can view the originating Diff in D85504061.

Summary: This change adds fallback support when mlx5dv (Mellanox device-specific extensions) is not available for RDMA operations. It modifies the queue pair creation logic to conditionally use either extended mlx5dv-based queue pairs (when supported) or standard ibverbs queue pairs (as fallback). The pt_cuda_alloc flag is updated to require mlx5dv support since it's necessary for merging memory segments when using PyTorch's CUDA allocator. The change adds a new `is_extended` parameter to control whether to create extended or standard queue pairs at runtime. Adds an env variable `MONARCH_RDMA_MLX5DV_DISABLED` to test the new code path on dev machine. Differential Revision: D85504061

Summary: This change adds fallback support when mlx5dv (Mellanox device-specific extensions) is not available for RDMA operations. It modifies the queue pair creation logic to conditionally use either extended mlx5dv-based queue pairs (when supported) or standard ibverbs queue pairs (as fallback). The pt_cuda_alloc flag is updated to require mlx5dv support since it's necessary for merging memory segments when using PyTorch's CUDA allocator. The change adds a new `is_extended` parameter to control whether to create extended or standard queue pairs at runtime. Adds an env variable `MONARCH_RDMA_MLX5DV_DISABLED` to test the new code path on dev machine. ## Changes in Latest Revision Based on reviewer feedback, the implementation has been updated with a cleaner, configuration-based approach: **API Changes:** - Replaced `uint8_t is_extended` parameter with `rdma_qp_type_t` enum in C API - Added `RdmaQpType` enum to Rust with three variants: - `Auto`: Auto-detect based on device capabilities (default) - `Standard`: Force standard ibverbs queue pairs - `Mlx5dv`: Force mlx5dv extended queue pairs - Added `qp_type` field to `IbverbsConfig` for explicit QP type control - C code uses switch statement with proper default case for unknown types **Architecture:** - Rust resolves `Auto` mode before calling C (single source of truth for detection) - C function becomes a pure executor - no capability detection logic - Removed environment variable approach in favor of configuration **Testing:** - Added `setup_with_qp_type()` helper function in test utilities - Added 4 new unit tests to verify standard QP fallback path: - `test_rdma_read_into_standard_qp` (CPU-to-CPU) - `test_rdma_write_from_standard_qp` (CPU-to-CPU) - `test_rdma_read_into_standard_qp_cuda` (GPU-to-GPU) - `test_rdma_write_from_standard_qp_cuda` (GPU-to-GPU) Reviewed By: dstaay-fb Differential Revision: D85504061

meta-codesync · 2025-10-30T07:00:22Z

This pull request has been merged in 434e447.

Summary: Pull Request resolved: meta-pytorch#1665 This change adds fallback support when mlx5dv (Mellanox device-specific extensions) is not available for RDMA operations. It modifies the queue pair creation logic to conditionally use either extended mlx5dv-based queue pairs (when supported) or standard ibverbs queue pairs (as fallback). The pt_cuda_alloc flag is updated to require mlx5dv support since it's necessary for merging memory segments when using PyTorch's CUDA allocator. The change adds a new `is_extended` parameter to control whether to create extended or standard queue pairs at runtime. Adds an env variable `MONARCH_RDMA_MLX5DV_DISABLED` to test the new code path on dev machine. ## Changes in Latest Revision Based on reviewer feedback, the implementation has been updated with a cleaner, configuration-based approach: **API Changes:** - Replaced `uint8_t is_extended` parameter with `rdma_qp_type_t` enum in C API - Added `RdmaQpType` enum to Rust with three variants: - `Auto`: Auto-detect based on device capabilities (default) - `Standard`: Force standard ibverbs queue pairs - `Mlx5dv`: Force mlx5dv extended queue pairs - Added `qp_type` field to `IbverbsConfig` for explicit QP type control - C code uses switch statement with proper default case for unknown types **Architecture:** - Rust resolves `Auto` mode before calling C (single source of truth for detection) - C function becomes a pure executor - no capability detection logic - Removed environment variable approach in favor of configuration **Testing:** - Added `setup_with_qp_type()` helper function in test utilities - Added 4 new unit tests to verify standard QP fallback path: - `test_rdma_read_into_standard_qp` (CPU-to-CPU) - `test_rdma_write_from_standard_qp` (CPU-to-CPU) - `test_rdma_read_into_standard_qp_cuda` (GPU-to-GPU) - `test_rdma_write_from_standard_qp_cuda` (GPU-to-GPU) Reviewed By: dstaay-fb Differential Revision: D85504061 fbshipit-source-id: a54466a309ff086eae96a63f7edf994655664826

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 25, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 25, 2025

casteryh force-pushed the export-D85504061 branch from 953ecfc to 7fa6733 Compare October 27, 2025 16:09

casteryh force-pushed the export-D85504061 branch from 7fa6733 to c3bcb7a Compare October 28, 2025 17:40

casteryh force-pushed the export-D85504061 branch from c3bcb7a to cebd0b8 Compare October 29, 2025 17:21

casteryh force-pushed the export-D85504061 branch from cebd0b8 to 2210944 Compare October 29, 2025 21:55

casteryh force-pushed the export-D85504061 branch from 2210944 to 907a9d3 Compare October 30, 2025 01:52

casteryh force-pushed the export-D85504061 branch from 907a9d3 to 9828943 Compare October 30, 2025 05:45

meta-codesync bot closed this in 434e447 Oct 30, 2025

facebook-github-bot added the Merged label Oct 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fallback when mlx5dv is not supported. #1665

Fallback when mlx5dv is not supported. #1665

Uh oh!

casteryh commented Oct 25, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 25, 2025

Uh oh!

meta-codesync bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fallback when mlx5dv is not supported. #1665

Fallback when mlx5dv is not supported. #1665

Uh oh!

Conversation

casteryh commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes in Latest Revision

Uh oh!

meta-codesync bot commented Oct 25, 2025

Uh oh!

meta-codesync bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

casteryh commented Oct 25, 2025 •

edited

Loading