Cooperative vector API #141

wjakob · 2025-04-18T00:36:08Z

This PR adds cooperative vector code generation support to Dr.Jit-Core. The main documentation and test suite of this feature is in the Dr.Jit parent project.

wjakob · 2025-04-18T00:36:23Z

@merlinND I forgot to create a separate PR for this part, done now.

merlinND

Review part 1 (missing llvm/cuda_coop_vec.cpp).

include/drjit-core/jit.h

src/coop_vec.h

src/coop_vec.cpp

merlinND

I had a look at the two remaining files in this PR. I don't know much about LLVM IR so I mostly reviewed the CUDA backend.

It's really nice how a few helpers make the long intrinsic calls in cuda_coop_vec.cpp quite compact and readable, despite all the different ops and cases.

src/optix_coop_vec.cpp

Dr.Jit can elide scatter operations when their result can no longer be referenced by any other operations. The logic to do so, and when reference count decreases are needed, was dispersed throughout ``eval.cpp``. This commit simplifies the underlying code.

Dr.Jit previously chose the lowest possible PTX version for each compute capability, but this ended up being too restrictive. It now ships a table containing a full driver version -> PTX version mapping and then searches it for the highest possible PTX version.

This commit adds cooperative vector code generation support to Dr.Jit-Core. The main documentation and test suite of this feature is in the Dr.Jit parent project.

- Use continuation callables when compiling OptiX programs that use the cooperative vector extension. This yields slightly better performance. - When the program uses wide (>64) CoopVec networks, set a high register target to avoid spills.

wjakob force-pushed the coopvec branch 2 times, most recently from 0122307 to 9532b48 Compare April 21, 2025 22:55

merlinND reviewed Apr 25, 2025

View reviewed changes

merlinND reviewed May 13, 2025

View reviewed changes

wjakob added 2 commits May 30, 2025 23:00

Scatter elision cleanup

7909a65

Dr.Jit can elide scatter operations when their result can no longer be referenced by any other operations. The logic to do so, and when reference count decreases are needed, was dispersed throughout ``eval.cpp``. This commit simplifies the underlying code.

wjakob force-pushed the coopvec branch from d808568 to aea8ffb Compare May 30, 2025 14:01

Wenzel Jakob added 2 commits May 30, 2025 23:27

Cooperative vector API

148355d

This commit adds cooperative vector code generation support to Dr.Jit-Core. The main documentation and test suite of this feature is in the Dr.Jit parent project.

Performance optimizations

d17b126

- Use continuation callables when compiling OptiX programs that use the cooperative vector extension. This yields slightly better performance. - When the program uses wide (>64) CoopVec networks, set a high register target to avoid spills.

wjakob force-pushed the coopvec branch from aea8ffb to d17b126 Compare May 30, 2025 14:28

wjakob merged commit b402831 into master May 30, 2025
5 checks passed

wjakob deleted the coopvec branch May 30, 2025 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cooperative vector API #141

Cooperative vector API #141

Uh oh!

wjakob commented Apr 18, 2025

Uh oh!

wjakob commented Apr 18, 2025

Uh oh!

merlinND left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

merlinND left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Cooperative vector API #141

Cooperative vector API #141

Uh oh!

Conversation

wjakob commented Apr 18, 2025

Uh oh!

wjakob commented Apr 18, 2025

Uh oh!

merlinND left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

merlinND left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!