Fix vertical_transport with vanleer_limiter on GPU #4104

nefrathenrici · 2025-11-14T00:31:06Z

Purpose

Previously, we encountered an InvalidIRError from the van Leer limiter on CUDA when using itime. This PR fixes the issue by converting dt to an FT before passing it to ᶠlin_vanleer.
We currently convert FT(dt) at a lot of call sites, perhaps we should follow up on this PR and push these conversions into the corresponding methods.
No automated tests catch this, perhaps we should use itime more widely.

Reproducer:

using Revise, Infiltrator 
ENV["CLIMACOMMS_CONTEXT"]="SINGLETON"
ENV["CLIMACOMMS_DEVICE"]="CUDA"

import CUDA, ClimaAtmos as CA

config_dict = Dict(
    "apply_limiter" => true,
    "h_elem" => 12,
    "z_elem" => 25,
    "rayleigh_sponge" => false,
    "viscous_sponge" => false,
    "dt" => "1secs",
    "t_end" => "10secs",
    "log_progress" => false,
    "moist" => "nonequil",
    "surface_setup" => "DefaultMoninObukhov",
    "rad" => "allskywithclear",
    "vert_diff" => false,
    "precip_model" => "1M",
    "turbconv" => "diagnostic_edmfx",
    "edmfx_sgs_mass_flux" => true,
    "edmfx_sgs_diffusive_flux" => true,
    "edmfx_sgsflux_upwinding" => "vanleer_limiter",
    "edmfx_tracer_upwinding" => "vanleer_limiter",
    "use_itime" => true,

)
config = CA.AtmosConfig(config_dict)
sim = CA.get_simulation(config)

dt = sim.integrator.dt
cρ = sim.integrator.u.c.ρ
f_u3 = sim.integrator.p.precomputed.ᶠu³
cχ = sim.integrator.u.c.ρq_tot ./ sim.integrator.u.c.ρ

Base.Broadcast.materialize(CA.vertical_transport(cρ, f_u3, cχ, dt, Val(:vanleer_limiter)))

Content

I have read and checked the items on the review checklist.

trontrytel · 2025-11-16T15:45:04Z

Is it a fix to this issue: #4046? The error looks different?

nefrathenrici · 2025-11-17T19:46:58Z

Is it a fix to this issue: #4046? The error looks different?

This PR should fix #4046, here is the error log from the current main.

trontrytel · 2025-11-17T20:07:37Z

Should we maybe change one of the GPU CI cases to run with vanleer to test this?

nefrathenrici · 2025-11-17T21:57:09Z

Should we maybe change one of the GPU CI cases to run with vanleer to test this?

Yes, do you have one in mind? We need to use itime with vanleer to test this.

trontrytel · 2025-11-17T22:03:17Z

I don't know where itime is used or what it is? Any GPU spherical prognostic edmf 1M simulation should do. Maybe we can add one to the CI or switch one of the 0M ones. It will probably not run for too long right now, but at least we will be testing if its compiling

nefrathenrici · 2025-11-18T02:12:26Z

I don't know where itime is used or what it is? Any GPU spherical prognostic edmf 1M simulation should do. Maybe we can add one to the CI or switch one of the 0M ones. It will probably not run for too long right now, but at least we will be testing if its compiling

ITime is another way of representing time in a simulation. @ph-kev wrote it so he can correct me but I think the most important advantage of itime over a float64 is to prevent floating point errors, particularly in long simulations.

The issue here was that dt is an ITime, not a float, so it needed to be converted before being passed to the limiter. The FT(dt) pattern to fix this should be cleaned up in a future PR, since this happens in many places, not just for the vanleer limiter.

I think we can just set use_itime: true in an existing GPU simulation. I can choose one and test it to ensure using itime breaks on main.

trontrytel · 2025-11-19T18:33:47Z

Sound good. Not sure if we have a

GPU
prognostic EDMF
1M
vanleer transport for tracers
in the CI. But of not, we should definitely have one

szy21 · 2025-11-22T00:15:59Z

I think this fixes the diagnostic + 1M + vanleer. It probably also fixes prognostic + 1M + vanleer. The problem with testing prognostic + 1M on gpu on ci is that it runs out of parameter memory on the central gpu.

nefrathenrici force-pushed the ne/vanleer branch from 0189827 to 7b3b78f Compare November 14, 2025 00:34

nefrathenrici requested review from dennisYatunin and trontrytel November 14, 2025 14:45

fix vertical_transport with vanleer_limiter on GPU

1ee7863

nefrathenrici force-pushed the ne/vanleer branch from 7b3b78f to 1ee7863 Compare November 17, 2025 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix vertical_transport with vanleer_limiter on GPU #4104

Fix vertical_transport with vanleer_limiter on GPU #4104

Uh oh!

nefrathenrici commented Nov 14, 2025 •

edited

Loading

Uh oh!

trontrytel commented Nov 16, 2025

Uh oh!

nefrathenrici commented Nov 17, 2025

Uh oh!

trontrytel commented Nov 17, 2025

Uh oh!

nefrathenrici commented Nov 17, 2025

Uh oh!

trontrytel commented Nov 17, 2025

Uh oh!

nefrathenrici commented Nov 18, 2025 •

edited

Loading

Uh oh!

trontrytel commented Nov 19, 2025

Uh oh!

szy21 commented Nov 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix vertical_transport with vanleer_limiter on GPU #4104

Are you sure you want to change the base?

Fix vertical_transport with vanleer_limiter on GPU #4104

Uh oh!

Conversation

nefrathenrici commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Content

Uh oh!

trontrytel commented Nov 16, 2025

Uh oh!

nefrathenrici commented Nov 17, 2025

Uh oh!

trontrytel commented Nov 17, 2025

Uh oh!

nefrathenrici commented Nov 17, 2025

Uh oh!

trontrytel commented Nov 17, 2025

Uh oh!

nefrathenrici commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trontrytel commented Nov 19, 2025

Uh oh!

szy21 commented Nov 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nefrathenrici commented Nov 14, 2025 •

edited

Loading

nefrathenrici commented Nov 18, 2025 •

edited

Loading