Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TagParallelFor: Fix launch bounds #4391

Closed

Conversation

WeiqunZhang
Copy link
Member

We must pass block size as a template parameter to amrex::launch, otherwise the incorrect launch bounds will be used if AMREX_GPU_MAX_THREADS is set to be less than 256 at compile time.

@WeiqunZhang WeiqunZhang requested a review from atmyers March 25, 2025 22:22
@WeiqunZhang WeiqunZhang requested a review from asalmgren April 2, 2025 19:09
We must pass block size as a template parameter to amrex::launch, otherwise
the incorrect launch bounds will be used if AMREX_GPU_MAX_THREADS is set to
be less than 256 at compile time.
@WeiqunZhang
Copy link
Member Author

Pretty confident that this is ready for merge.

@WeiqunZhang WeiqunZhang enabled auto-merge (squash) April 3, 2025 18:40
@WeiqunZhang WeiqunZhang disabled auto-merge April 9, 2025 19:54
@WeiqunZhang WeiqunZhang closed this Apr 9, 2025
@WeiqunZhang WeiqunZhang deleted the max_threads_128 branch April 9, 2025 19:54
@WeiqunZhang
Copy link
Member Author

Merged into #4165

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants