-
Notifications
You must be signed in to change notification settings - Fork 363
Initial OpenACC port of mpas_atm_update_bdy_tend #1301
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial OpenACC port of mpas_atm_update_bdy_tend #1301
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The changes make sense and GPU runs of this PR and a reference before these changes match.
I'll hold off on full approval for any changes Michael requests or for any changes to the commit history that may come.
Thanks for the review! I'll address the comments, but just wanted to note that this PR may not be a priority for merging to |
8bc1e96
to
057ee7e
Compare
Addressed the review comments. I did have to rebase with the latest |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, thanks! Once you've had a chance to rework the commit history, I'll go ahead and merge this.
This commit enables the GPU execution of the mpas_atm_update_bdy_tend routine using OpenACC directives for data movement and loops. A new timer has been added to time the host-device data transfers in this subroutine, with the label 'mpas_atm_update_bdy_tend [ACC_data_xfer]' This commit also introduces some integers for loop bounds, so as to dereference scalar integer pointers which the OpenACC parallel regions do not correctly copy to device memory.
dfaee24
to
fe2e528
Compare
Done. And thanks for the review! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
Initial OpenACC port of
mpas_atm_update_bdy_tend
.This port is required to keep
state
andtend
variables from LBCs eventually resident on GPUs.