Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test: add integration tests for stretched and lam use cases #217

Draft
wants to merge 14 commits into
base: main
Choose a base branch
from

Conversation

VeraChristina
Copy link
Collaborator

@VeraChristina VeraChristina commented Mar 25, 2025

Do Not Merge

  • The added tests take 3 mins each when run locally but very long on the github runners, likely related to how we fetch the data. Need to resolve this before merging.

Description

  • Add two integration tests for small training cycles using
    • a stretched grid
    • a local area model
  • Add tests for config validation based on the same configs
  • Update file names for the global use case for consistency -- i.e. using the name of the template used (config.yaml)

Code Compatibility

  • I have performed a self-review of my code

Code Performance and Testing

  • I ran the complete Pytest test suite locally, and they pass
  • I have run the new integration tests locally, and they pass

Documentation

  • My code follows the style guidelines of this project
  • I have updated the documentation and docstrings to reflect the changes
  • I have added comments to my code, particularly in hard-to-understand areas

📚 Documentation preview 📚: https://anemoi-training--217.org.readthedocs.build/en/217/


📚 Documentation preview 📚: https://anemoi-graphs--217.org.readthedocs.build/en/217/


📚 Documentation preview 📚: https://anemoi-models--217.org.readthedocs.build/en/217/

@VeraChristina
Copy link
Collaborator Author

Tagging @anaprietonem because we wanted to revisit the file naming discussed on the first integration tests PR

@VeraChristina VeraChristina marked this pull request as ready for review March 27, 2025 11:24
@VeraChristina VeraChristina marked this pull request as draft March 27, 2025 14:49
@VeraChristina
Copy link
Collaborator Author

marking as draft while adding the lam use case to this PR as well

@VeraChristina VeraChristina changed the title test: add integration test for stretched grid use case test: add integration tests for stretched and lam use cases Mar 27, 2025
@VeraChristina VeraChristina marked this pull request as ready for review March 27, 2025 14:59
theissenhelen
theissenhelen previously approved these changes Mar 27, 2025
Copy link
Collaborator

@theissenhelen theissenhelen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Happy with these changes!

dataset:
cutout:
- dataset: ${hardware.paths.data}/${hardware.files.dataset}
thinning: 25 # ???-value in lam template
Copy link
Collaborator

@anaprietonem anaprietonem Mar 31, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is that a general value ? @JPXKQX views on this?

data: https://object-store.os-api.cci1.ecmwf.int/ml-tests/test-data/samples/anemoi-integration-tests
files:
dataset: regional-use-cases/cerra-rr-an-oper-0001-mars-5p5km-2017-2017-6h-v2-hmsi-testing.zarr
forcing_dataset: regional-use-cases/aifs-ea-an-oper-0001-mars-o96-2017-2017-6h-v8-testing.zarr
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For the forcing dataset, how long does this train currently takes? I believe for the global use case we were using a o48 dataset so wonder if that could make things faster

`training/tests/integration/test_training_cycle.yaml`. It furthermore
applies use-case-specific modifications as detailed in
`training/tests/integration/test_basic.yaml` to provide the location of
`training/tests/integration/config/testing_modifications.yaml`. It
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Regarding the naming:

  • the 'test_config.yaml' is the one related to the global deterministic model right? Could we more meaningful to call it test_global.yaml.
  • Related to this, in the future we would have uses cases related to probabilistic model or tests for the temporal interpolator. Any view on how to scale the naming conventions for those? Just wondering it could be good to do it already now with test_[coverage][model_task][forecaster_class]. Where coverage would be global/lam/stretched grid, model_task could be forecaster/interpolator and class would refer to single or probabilistic

Copy link
Collaborator

@anaprietonem anaprietonem left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work both! I left some comments regarding views on scaling the naming convention and pinned Mario as he might has some views on the settings for LAM & stretched grid

@MeraX
Copy link
Contributor

MeraX commented Mar 31, 2025

Thank you for this inspiring PR. I have to minor questions, posted individually, and I'm wondering whether this test is supposed to run on a CPU?

overrides=overrides,
) # apply architecture overrides to template since they override a default
use_case_modifications = OmegaConf.load(Path.cwd() / "training/tests/integration/config/test_basic.yaml")
use_case_modifications = OmegaConf.load(Path.cwd() / "training/tests/integration/config/test_config.yaml")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is Path.cwd() relative to the working directory when executing pytest?

temp_dir = str(tmp_path)
testing_modifications.hardware.paths.output = temp_dir
return testing_modifications
def stretched_config(testing_modifications_with_temp_dir: OmegaConf) -> None:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the return type correct?

@VeraChristina VeraChristina marked this pull request as draft April 4, 2025 09:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Now In Progress
Development

Successfully merging this pull request may close these issues.

4 participants