eckit::linalg::sparse::LinearAlgebraTorch backend #165

pmaciel · 2025-02-18T14:52:12Z

This PR adds a sparse lienar algebra backend to allow for GPU-based matrix multiplications (and other operations), which translates into a significant performance increase for interpolations, in the right conditions (right environment, advanced use of mir.)

It makes use of a deployed version of PyTorch (findable by CMake), specifically its lower level component "Torch", which is part of the same package (this is how it is released to the public.) I've exposed all possible hardware configuration options, contemporary. But obviously, the better development is to improve the whole workflow to avoid copies to/from the CPU/GPU, so this develolpment is purelly a stepping stone -- it has already allowed me to run both mars-client (C) and pgen on GPUs, and of course mir. It would be great to follow this up with a publication. Possibly, this could be configurable in earthkit-regrid for maximum marketing :-)

I've held back this development for several months, but I couldn't find a definite response on when to post it -- here it is.

codecov-commenter · 2025-02-18T14:59:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 64.09%. Comparing base (f1591f4) to head (b548c96).

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #165      +/-   ##
===========================================
- Coverage    64.09%   64.09%   -0.01%     
===========================================
  Files         1083     1083              
  Lines        55759    55759              
  Branches      4120     4120              
===========================================
- Hits         35738    35736       -2     
- Misses       20021    20023       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wdeconinck · 2025-02-19T16:13:16Z

This looks neat!
I'd like to test this before merging, preferably on our HPC.
Could you get a working Torch that I could link into e.g. with a setup using the nvidia compiler?

As a side, it would be nice to clean up and remove unused backends (armadillo, viennacl, ... )

pmaciel requested review from wdeconinck, tlmquintino, sandorkertesz, iainrussell and danovaro February 18, 2025 14:52

pmaciel force-pushed the feature/torch branch from 75c7d83 to 06d774e Compare February 19, 2025 21:07

eckit::linalg::sparse::LinearAlgebraTorch backend

b548c96

pmaciel force-pushed the feature/torch branch from 06d774e to b548c96 Compare February 20, 2025 09:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eckit::linalg::sparse::LinearAlgebraTorch backend #165

eckit::linalg::sparse::LinearAlgebraTorch backend #165

pmaciel commented Feb 18, 2025 •

edited

Loading

codecov-commenter commented Feb 18, 2025 •

edited

Loading

wdeconinck commented Feb 19, 2025

eckit::linalg::sparse::LinearAlgebraTorch backend #165

Are you sure you want to change the base?

eckit::linalg::sparse::LinearAlgebraTorch backend #165

Conversation

pmaciel commented Feb 18, 2025 • edited Loading

codecov-commenter commented Feb 18, 2025 • edited Loading

Codecov Report

wdeconinck commented Feb 19, 2025

pmaciel commented Feb 18, 2025 •

edited

Loading

codecov-commenter commented Feb 18, 2025 •

edited

Loading