test: add basic system-level test suite #38

VeraChristina · 2025-07-08T13:57:31Z

This PR adds an ecflow suite that can be deployed and monitored by a github workflow. The purpose is to test the anemoi pipeline end-to-end, from dataset creation to inference.

The suite added by this PR consists of

set-up
dataset creation
training

The suite is set up such that new test cases for dataset creation or training can be added as configs (no pyflow knowledge required).

The github workflow

deploys and runs the suite
monitors whether it has finished or a task failed
prints results
cleans up the suite if the tests were successful

Before merging, we need to

document how to add new test cases
remove the pull request trigger and set a schedule for the regular action
draft issues for next steps
set the schedule: run nightly, and check if there were any commits in any of the repos tested (currently datasets and training)

Not part of this PR:

checks for datasets task to verify that the created dataset looks as expected
periodic clean up of dirs created by tasks > 2 days
inference family
finetuning family: forking, resuming, (rollout) etc.
Review of monitor action -- needs to finish when all parts are done/aborted/queued (currently finishes also when some tasks are aborted)
more test cases -- need to gather requirements for these

📚 Documentation preview 📚: https://anemoi--38.org.readthedocs.build/en/38/

for more information, see https://pre-commit.ci

VeraChristina · 2025-09-03T12:59:14Z

As discussed yesterday, I have added a workflow for nightly tests that first checks for commits within the past 24h in any of the repos (currently: anemoi-docs, anemoi-datasets, anemoi-core), and only runs the tests (on all mains) if there has been a change in any of them.

Workflow run here: https://github.com/ecmwf/anemoi-docs/actions/runs/17433383922 (The monitor action fails here because the workflow already points to main of anemoi-docs where the test suite isn't added yet. This should be fine once merged.)

jjlk

Minor optional comments, ok for me to merge as it is!

...test/configs/datasets/aifs-ea-an-oper-0001-mars-o96-2017-2017-6h-v8-testing/task_config.yaml

...t/configs/datasets/cerra-rr-an-oper-0001-mars-5p5km-2017-2017-6h-v3-testing/task_config.yaml

tests/system-level/anemoi_test/nodes.py

docs/contributing/testing.rst

tests/system-level/anemoi_test/configs/training/global/task_config.yaml

tests/system-level/anemoi_test/configs/training/lam/task_config.yaml

tests/system-level/anemoi_test/nodes.py

tests/system-level/anemoi_test/configs/training/basic_check.sh

tests/system-level/anemoi_test/nodes.py

aaron-hopkinson

A couple of minor comments – sorry!

MeraX · 2025-09-05T14:48:26Z

If needed, we can look into ways of surfacing git hashes more directly, rather than just in the logs.

Thank you @VeraChristina, for your reply. Could you help me to find the git hashes in https://github.com/ecmwf/anemoi-docs/actions/runs/17433383922/job/49499118253

And why has this PR been merged without any ATS tag?

anaprietonem · 2025-09-08T07:11:11Z

If needed, we can look into ways of surfacing git hashes more directly, rather than just in the logs.

Thank you @VeraChristina, for your reply. Could you help me to find the git hashes in https://github.com/ecmwf/anemoi-docs/actions/runs/17433383922/job/49499118253

And why has this PR been merged without any ATS tag?

Hey @MeraX, good catch that we need to update the labels of this repo and also add the github action so it has the same workflow in terms of PR labelling as the other repos. I will help with this. In terms of ATS, this PR was discussed at ATS where we also presented the overall approach. We clarified the initial focus would be to support testing from main branches and scale to main uses cases like global, LAM and stretched grid. Since there were no major concerns and overall this was see as a useful feature we told Vera this was approved. Note this doesn't mean the work on System Level Tests is done, as @VeraChristina captured https://github.com/ecmwf/anemoi-docs/issues?q=is%3Aissue%20state%3Aopen%20milestone%3A%22System-level%20tests%22 there are still improvements and features to be added, but nonetheless having this merged is a great step towards making Anemoi more robust!

VeraChristina · 2025-09-08T08:46:40Z

Thank you @VeraChristina, for your reply. Could you help me to find the git hashes in https://github.com/ecmwf/anemoi-docs/actions/runs/17433383922/job/49499118253

Hi @MeraX, sure thing! -- You navigate to the summary, then open the logs for the package whose version you want to check, and scroll down to find the local version tag, which includes a short commit hash.

(...)

(...)

Clearly, this is not ideal since it's very buried in the logs and just a short commit hash. Once we start using the test suite more comprehensively, i.e. outside of just testing all main branches nightly (which will make it much more complicated to know which versions were tested together), we definitely need a better way of showing this.

I've added a ticket to the milestone. Feel free to add any details or considerations I've missed!

MeraX · 2025-09-08T10:20:39Z

Hi Vera,

thanks for the insight. It appears to me, that this action does not test compatibility across all main branches but that the dependencies for each test are rather taken from PyPI. E.g. one task tests anemoi-datasets==0.1.dev1+gfa0b7e809 while the core tests use the version anemoi-datasets==0.5.26 from PyPI. Is this intended?

From the anemoi-datasets tests:

From the anemoi-core tests:

 + anemoi-datasets==0.5.26
 + anemoi-graphs==0.6.5.post3 (from file:///lus/h1resw02/project/prepml/ecflow_server/workdirs/testing/anemoi_tests/nightly/local/build/training_env/anemoi_training/graphs)
 + anemoi-models==0.9.4.post3 (from file:///lus/h1resw02/project/prepml/ecflow_server/workdirs/testing/anemoi_tests/nightly/local/build/training_env/anemoi_training/models)
 + anemoi-training==0.6.4.post3 (from file:///lus/h1resw02/project/prepml/ecflow_server/workdirs/testing/anemoi_tests/nightly/local/build/training_env/anemoi_training/training)
 + anemoi-transform==0.1.16
 + anemoi-utils==0.4.35

VeraChristina · 2025-09-08T10:56:41Z

@MeraX Thanks for pointing this out. What I meant by “across anemoi branches” is more about the end-to-end user workflow: creating a dataset → training → inference, with the various anemoi packages installed from main at each step. That way, we check whether training works when the dataset was created with the current anemoi-datasets (and soon, whether the resulting checkpoint is compatible with current inference).

So yes, this is intentional in the sense that it’s closer to the workflow we expect users to follow (installing training from main or PyPI, not mixing versions within the same environment). As you note, we may need to update dependencies, and when things break we probably want to see that, since we don’t do synchronized releases across the pipeline. That said, I agree we should have a way to test whether the pipeline with updated dependencies will work.

This setup is not meant to be comprehensive — just a starting point. We’ll likely want to extend what’s tested over time, and it’d be great if others can share/document ideas. If you think the current approach should be changed, could you please open an issue so we can track the discussion there instead of on this closed PR?

MeraX · 2025-09-09T06:32:29Z

Thanks for this clarification, I might have missed that point of view in the discussion.

I believe there is a myriad of ways how to set up Anemoi and it's just important to make clear, what has been proven to work and what not.

VeraChristina and others added 30 commits May 21, 2025 13:00

add scripts for dataset creation and training

8c65e64

add basic suite to test dataset creation and training

19eb068

use venv and uv

d660f8d

update test config folder structure

06086f0

update to new wellies version

31319be

add workflow drafts

a1d619e

add secrets and pull request trigger for debugging

8561dde

update secrets

e56cdc0

debug

cca80ad

add print results step

8b828f7

debug

8847483

remove print results step

f0ff171

debug

304dd27

add print results step

e9c7eb2

small fix in print results step

d4d774b

clean only after printing

e0c25eb

always run print, comment out clean

b5be3a6

comment out clean

0795c3e

Minor debug in print action

f9141f8

Update system_level_test.yaml

49ebf50

add top level group and did minor cleanup

d1845d7

debug action

69176a6

update steps triggers

74706f3

[pre-commit.ci] auto fixes from pre-commit.com hooks

6403c4a

for more information, see https://pre-commit.ci

minor change in structure

5512a01

merge conflicts

a49a578

fix bug in action

3787b0d

debug run task in action

c2091f4

debug run task in action

846364f

debug run task in action

d7c6445

VeraChristina and others added 5 commits August 29, 2025 13:38

make anemoi commands configurable

992a843

update docs

9435afe

Update path to tracksuite-print (#54)

508d004

add nightly workflow which checks for recent commits

66e6de0

typo and remove pr trigger

4f07ebb

VeraChristina requested review from corentincarton, aaron-hopkinson and jjlk September 3, 2025 13:01

jjlk previously approved these changes Sep 3, 2025

View reviewed changes

VeraChristina and others added 2 commits September 3, 2025 14:44

Merge branch 'main' into test/system-level-prototype-ecflow

1f847a8

make task_config entries more generic

9deee33

VeraChristina dismissed jjlk’s stale review via 9deee33 September 3, 2025 14:42

aaron-hopkinson reviewed Sep 4, 2025

View reviewed changes

tests/system-level/anemoi_test/configs/training/basic_check.sh Show resolved Hide resolved

aaron-hopkinson reviewed Sep 4, 2025

View reviewed changes

tests/system-level/anemoi_test/configs/training/basic_check.sh Show resolved Hide resolved

aaron-hopkinson reviewed Sep 4, 2025

View reviewed changes

tests/system-level/anemoi_test/nodes.py Show resolved Hide resolved

aaron-hopkinson requested changes Sep 4, 2025

View reviewed changes

aaron-hopkinson approved these changes Sep 5, 2025

View reviewed changes

corentincarton approved these changes Sep 5, 2025

View reviewed changes

VeraChristina merged commit 0981601 into main Sep 5, 2025
5 checks passed

VeraChristina deleted the test/system-level-prototype-ecflow branch September 5, 2025 14:27

github-project-automation bot moved this from Under Review to Done in Anemoi-dev Sep 5, 2025

MeraX mentioned this pull request Sep 8, 2025

Show git hashes of tested anemoi packages in ci summary #56

Open

VeraChristina mentioned this pull request Sep 10, 2025

Build environments in system-level tests #60

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: add basic system-level test suite #38

test: add basic system-level test suite #38

Uh oh!

VeraChristina commented Jul 8, 2025 •

edited

Loading

Uh oh!

VeraChristina commented Sep 3, 2025

Uh oh!

jjlk left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aaron-hopkinson left a comment

Uh oh!

Uh oh!

MeraX commented Sep 5, 2025

Uh oh!

anaprietonem commented Sep 8, 2025

Uh oh!

VeraChristina commented Sep 8, 2025 •

edited

Loading

Uh oh!

MeraX commented Sep 8, 2025

Uh oh!

VeraChristina commented Sep 8, 2025 •

edited

Loading

Uh oh!

MeraX commented Sep 9, 2025

Uh oh!

Uh oh!

test: add basic system-level test suite #38

test: add basic system-level test suite #38

Uh oh!

Conversation

VeraChristina commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VeraChristina commented Sep 3, 2025

Uh oh!

jjlk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aaron-hopkinson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MeraX commented Sep 5, 2025

Uh oh!

anaprietonem commented Sep 8, 2025

Uh oh!

VeraChristina commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MeraX commented Sep 8, 2025

Uh oh!

VeraChristina commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MeraX commented Sep 9, 2025

Uh oh!

Uh oh!

VeraChristina commented Jul 8, 2025 •

edited

Loading

VeraChristina commented Sep 8, 2025 •

edited

Loading

VeraChristina commented Sep 8, 2025 •

edited

Loading