GPU compute API abstraction on top of WebGPU #3238

daljit46 · 2025-11-26T13:01:00Z

This PR supersedes #3096.

It introduces a new GPU compute abstraction built on top of WebGPU. See #3096 for general motivation and design philosophy. A notable change from that PR is now shaders are now required to be written in Slang, a new programming language created by NVidia (now under the umbrella of the Khronos group). This choice has been motivated by the fact that Slang provides many useful features like modules and generics for better code reusability and modularisation.

Some issues still need to be resolved (in future PRs):

Deal with the issue raised in Add option for canonical direct I/O layout #3108.
Add more tests, especially for the upload/download of MR::Image instances to/from GPU (currently the API only supports MR::Image<float>).
Create a real-world example that illustrates the use of the API.

One additional thing that this PR introduces is the addition of a new tcb::span class (see #3219); however, unlike for other third-party dependencies, the class has been directly added to the codebase rather than fetched via CMake. The hope is that this class will become redundant once we have access to C++20's std::span.

github-actions

clang-tidy made some suggestions

There were too many comments to post at once. Showing the first 25 out of 83. Check the log or trigger a new build to see more.

cpp/core/gpu/gpu.cpp

cpp/core/gpu/gpu.h

cpp/core/span.h

testing/unit_tests/gputests.cpp

github-actions

clang-tidy made some suggestions

There were too many comments to post at once. Showing the first 25 out of 58. Check the log or trigger a new build to see more.

testing/unit_tests/gputests.cpp

github-actions

clang-tidy made some suggestions

There were too many comments to post at once. Showing the first 25 out of 33. Check the log or trigger a new build to see more.

testing/unit_tests/gputests.cpp

github-actions

clang-tidy made some suggestions

There were too many comments to post at once. Showing the first 25 out of 33. Check the log or trigger a new build to see more.

testing/unit_tests/gputests.cpp

github-actions

clang-tidy made some suggestions

There were too many comments to post at once. Showing the first 25 out of 33. Check the log or trigger a new build to see more.

testing/unit_tests/gputests.cpp

github-actions

clang-tidy made some suggestions

testing/unit_tests/gputests.cpp

github-actions

clang-tidy made some suggestions

cpp/core/gpu/gpu.h

github-actions

clang-tidy made some suggestions

cpp/core/gpu/gpu.h

github-actions

clang-tidy made some suggestions

cpp/core/gpu/gpu.h

github-actions

clang-tidy made some suggestions

cpp/core/gpu/slangcodegen.cpp

cpp/core/gpu/slangcodegen.h

This a generic GPGPU compute abstraction built on top of WebGPU. To run operations on the GPU, shaders need to be written using the Slang programming language. See https://dawn.googlesource.com/dawn See https://shader-slang.org/

DOWNLOAD_EXTRACT_TIMESTAMP is only available on CMake >=3.24. This change make CMake ignore the option on older versions.

We also set CMP0135 policy behaviour to NEW to fix warnings.

We no longer require this. If the user of the API wants to write arbitrary data into a GPU buffer, they construct a span object and use the other overload with `tcb::span`.

github-actions

clang-tidy made some suggestions

cpp/core/gpu/gpu.h

testing/unit_tests/gputests.cpp

Include device limits in DeviceInfo and enforce 4-byte alignment for buffer write offsets and sizes. Validate uniform buffer offsets against minUniformBufferOffsetAlignment.

Previously we looked up the requested entry point of a Slang shader by name, but then assumed entry point index 0 when: - extracting WGSL and computing the shader-cache key - reflecting bindings and compute workgroup size from ProgramLayout With multiple entry points in the module, kernels could silently compile/cache WGSL for the wrong entry point or reflect bindings/workgroup size from the wrong entry point. This fixes the problem by selecting the intended entry point from the linked slang::ProgramLayout by matching nameOverride/name, then using its index for WGSL extraction and hashing. Also remove the TODOs regarding supporting multiple entry points. Our shader compilation requires a given WebGPU kernel is tied to a single entry point, so supporting multiple entry points is not needed.

This reverts commit 0428640.

Co-authored-by: Robert Smith <[email protected]>

github-actions

clang-tidy made some suggestions

cpp/core/gpu/slangcodegen.cpp

testing/unit_tests/gputests.cpp

This solution doesn't seem to work in all cases. For example, it doesn't guarantee successfully creating the Vulkan-backed WebGPU instance in our CI tests. For now, we remove this and we'll investigate a more appopriate solution later.

daljit46 · 2026-01-19T12:14:24Z

I've now addressed all feedback in this PR. I've also removed all TODOs in the code by fixing the issues, except in the case of obtaining a Vulkan-backed WebGPU instance for MSYS2 environments (in which case I've just deleted the logic to manually copy the Vulkan dll). As mentioned earlier, this is not blocking, and we can investigate later what can be done about it.
All tests are passing, so this is ready for merging.

daljit46 · 2026-01-28T11:06:42Z

We discussed in the meeting today to gatekeep the addition of this PR (and future PRs related to GPU work), under a new CMake configuration option MRTRIX_ENABLE_GPU. This flag would enable/disable the fetching of WebGPU and Slang dependencies (in addition to the all cpp files added here). The motivation is that we want to release a version 3.1.0 of MRtrix3 that will (may?) not include the addition of this work and instead create a separate branch specificaly targeting that release.
It's fairly straightforward to add this flag, but @jdtournier mentioned that this flag should be OFF by default. One downside of that is this will disable CI tests for the GPU work (including future related PRs). So my proposal is to instead enable the flag by default and explicitly disable in the new release branch. Alternatively, we could explcitly enable the flag in the CI workflows, but I think that's a bit unncessary and more invasive.

daljit46 · 2026-02-04T16:22:01Z

@MRtrix3/mrtrix3-devs Unless there are any further comments regarding this PR, I will merge this tomorrow.

All feeback has been addressed

daljit46 self-assigned this Nov 26, 2025

daljit46 mentioned this pull request Nov 26, 2025

Experimental GPU-compute abstraction using WebGPU #3096

Closed

daljit46 requested review from a team and removed request for a team November 26, 2025 13:02

github-actions bot reviewed Nov 26, 2025

View reviewed changes

daljit46 force-pushed the webgpu branch from 6ecaa3f to 962afeb Compare November 26, 2025 13:36

github-actions bot reviewed Nov 26, 2025

View reviewed changes

cpp/core/gpu/gpu.h Outdated Show resolved Hide resolved

github-actions bot reviewed Nov 26, 2025

View reviewed changes

cpp/core/gpu/gpu.h Outdated Show resolved Hide resolved

github-actions bot reviewed Nov 26, 2025

View reviewed changes

cpp/core/gpu/gpu.h Outdated Show resolved Hide resolved

daljit46 force-pushed the webgpu branch from aad3aec to 13e5410 Compare November 26, 2025 21:54

Add Slang + Dawn dependencies

69855b0

daljit46 force-pushed the webgpu branch from 97d59d7 to 3b4b99a Compare November 27, 2025 12:04

github-actions bot reviewed Nov 27, 2025

View reviewed changes

cpp/core/gpu/slangcodegen.cpp Show resolved Hide resolved

cpp/core/gpu/slangcodegen.h Outdated Show resolved Hide resolved

cpp/core/gpu/slangcodegen.h Outdated Show resolved Hide resolved

cpp/core/gpu/slangcodegen.h Outdated Show resolved Hide resolved

daljit46 force-pushed the webgpu branch 2 times, most recently from 4e205c0 to 24b7532 Compare November 27, 2025 14:25

daljit46 added 10 commits November 27, 2025 14:34

Add new GPU compute API

e89112d

This a generic GPGPU compute abstraction built on top of WebGPU. To run operations on the GPU, shaders need to be written using the Slang programming language. See https://dawn.googlesource.com/dawn See https://shader-slang.org/

Add unit tests for GPU API

d7c70cc

Reorder arguments for FetchContent_Declare

c24a724

DOWNLOAD_EXTRACT_TIMESTAMP is only available on CMake >=3.24. This change make CMake ignore the option on older versions.

Fix typos

9725022

Remove DOWNLOAD_EXTRACT_TIMESTAMP

f63dd43

We also set CMP0135 policy behaviour to NEW to fix warnings.

Don't run unit tests in parallel

0428640

Add Threads package requirement for Dawn integration

9d9111e

Update Dawn library path handling for Linux systems

1c58428

Update required cmake version in readme

3e763d5

Add mesa-vulkan-drivers to dependency installation for Linux builds

13a684b

daljit46 added 2 commits January 15, 2026 16:28

Remove redundant write_to_buffer overload with void*

ee18874

We no longer require this. If the user of the API wants to write arbitrary data into a GPU buffer, they construct a span object and use the other overload with `tcb::span`.

Use constexpr for desired buffer size constants

9027f69

github-actions bot reviewed Jan 15, 2026

View reviewed changes

cpp/core/gpu/gpu.h Show resolved Hide resolved

testing/unit_tests/gputests.cpp Show resolved Hide resolved

testing/unit_tests/gputests.cpp Show resolved Hide resolved

daljit46 and others added 13 commits January 15, 2026 17:48

Fix comment to correctly refer to Slang files instead of WGSL

44078e2

Align buffer writes to WebGPU requirements and add device limits

255a490

Include device limits in DeviceInfo and enforce 4-byte alignment for buffer write offsets and sizes. Validate uniform buffer offsets against minUniformBufferOffsetAlignment.

Mark new_buffer_from_host_image as [[nodiscard]]

9358f41

Add test for kernel using uniform buffer

e6fe1af

Fix inconsistent parameter names

b59c2d0

Fix formatting with clang-format

c2af099

Revert "Don't run unit tests in parallel"

20fc37f

This reverts commit 0428640.

Simplify selection of specialized entry point

9cda131

Co-authored-by: Robert Smith <[email protected]>

Avoid nested conditional operator when resolving entry point name

2c29bc8

Use uppercase unsigned literal suffixes

68103b9

Add missing include header

a725e1c

github-actions bot reviewed Jan 19, 2026

View reviewed changes

daljit46 added 4 commits January 19, 2026 11:46

Suppress check_syntax for resolved variable

4c94b1f

Fix narrowing conversion for Slang entry point

488c6a8

Remove hack to copy Vulkan dll on MSYS2

f6efe50

This solution doesn't seem to work in all cases. For example, it doesn't guarantee successfully creating the Vulkan-backed WebGPU instance in our CI tests. For now, we remove this and we'll investigate a more appopriate solution later.

Update copyright header for gputests.cpp

3e63d21

Add option to enable/disable GPU compute functionality

ad22afc

Lestropie mentioned this pull request Jan 29, 2026

Add option for canonical direct I/O layout #3108

Open

Merge branch 'dev' into webgpu

4617d68

daljit46 merged commit 3035c90 into dev Feb 5, 2026
5 of 6 checks passed

daljit46 deleted the webgpu branch February 5, 2026 07:30

GPU compute API abstraction on top of WebGPU #3238

GPU compute API abstraction on top of WebGPU #3238

Uh oh!

Conversation

daljit46 commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daljit46 commented Nov 26, 2025 •

edited

Loading