fix: Kcrps fixes for Pydantic schemas #189

anaprietonem · 2025-03-13T14:54:30Z

Description

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

Issue Number

Code Compatibility

I have performed a self-review of my code

Code Performance and Testing

I have added tests that prove my fix is effective or that my feature works
I ran the complete Pytest test suite locally, and they pass
I have tested the changes on a single GPU
I have tested the changes on multiple GPUs / multi-node setups
I have run the Benchmark Profiler against the old version of the code
If the new feature introduces modifications at the config level, I have made sure to update Pydantic Schemas and default configs accordingly

Dependencies

I have ensured that the code is still pip-installable after the changes and runs
I have tested that new dependencies themselves are pip-installable.
I have not introduced new dependencies in the inference portion of the pipeline

Documentation

My code follows the style guidelines of this project
I have updated the documentation and docstrings to reflect the changes
I have added comments to my code, particularly in hard-to-understand areas

Additional Notes

theissenhelen · 2025-03-19T09:33:57Z

training/src/anemoi/training/schemas/training.py

+    num_gpus_per_model: PositiveInt = Field(example=2)
+    "Number of GPUs per model."
+    # TODO(Ana): check why the read_group_size is not passed in the config
+    kwargs: dict[str, Any] = Field(default_factory=dict)


Just as note, we'll have to keep in mind that the kwargs will not be validated. We might want to look at including them in the config validation at a later stage.

theissenhelen · 2025-03-19T09:34:48Z

training/src/anemoi/training/schemas/training.py

    "Sanity check runs n batches of val before starting the training routine."
    gradient_clip: GradientClip
    "Config for gradient clipping."
-    forecaster: Any  # TODO: Fix this
+    forecaster: Any  # TODO(Simon): Fix this


Is there a plan to create a schema for the forecaster at some point?

Test failing.

mchantry · 2025-03-30T13:25:58Z

training/src/anemoi/training/config/interpolator.yaml

@@ -25,7 +27,7 @@ model:
  grid_skip: 0 # Which of the input indices to use as residual connection, null if none.

 training:
-  task: anemoi.training.train.interpolator.GraphInterpolator
+  model_class: anemoi.training.train.interpolator.GraphInterpolator


I think model_task is clearer, would need changing across multiple configs.

mchantry · 2025-03-30T13:26:25Z

training/src/anemoi/training/config/model/graphtransformer.yaml

@@ -58,7 +58,7 @@ processor:
  mlp_hidden_ratio: 4 # GraphTransformer or Transformer only
  num_heads: 16 # GraphTransformer or Transformer only
  cpu_offload: ${model.cpu_offload}
-  qk_norm: False
+  qk_norm: True


Why change the behaviour?

Need to check this with Simon and Helen. Looking at the settings, the qv_norm is kept to true for the processor in both the transformer (ens and single) and graph_transformer (ens). So that's why I thought this could be an error and needed correction. I will check with them before merging

fixes to default config and shemas

0906f03

github-actions bot added the training label Mar 13, 2025

anaprietonem changed the title ~~fixes to default config and shemas~~ fix: Kcrps fixes for Pydantic schemas Mar 14, 2025

anaprietonem added 5 commits March 14, 2025 14:51

schemas for strategies

7bd17d1

datamodule

d002e5e

minimum for forecaster schema

1a2bc3f

minimal for forecaster

51e025f

Merge branch 'kcrps' into kcprs/pydantic_chemas

fea125d

anaprietonem marked this pull request as ready for review March 20, 2025 08:52

harmonise

ae53e7f

theissenhelen self-requested a review March 24, 2025 11:05

anaprietonem added 2 commits March 27, 2025 15:55

update

73726d6

fix schema

c4e7afc

theissenhelen previously approved these changes Mar 28, 2025

View reviewed changes

anaprietonem added 4 commits March 30, 2025 08:00

add no validation flag

43dda72

integration tests fix

bce3b5a

integration tests fix

0244b86

defaults for truncation

6360aa2

mchantry reviewed Mar 30, 2025

View reviewed changes

anaprietonem added 6 commits March 30, 2025 14:57

update

0151fe2

remove duplicated config key

1a02c98

fix integration tests passing truncation path

6027770

fixing other uses cases

b08699f

fixes for model components

acd02bf

remove kwargs

ed30918

anaprietonem added the models label Mar 31, 2025

anaprietonem and others added 2 commits March 31, 2025 12:43

test using debug

17752eb

fix: add truncation data to supporting arrays

5bb5103

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Kcrps fixes for Pydantic schemas #189

fix: Kcrps fixes for Pydantic schemas #189

anaprietonem commented Mar 13, 2025

theissenhelen Mar 19, 2025

theissenhelen Mar 19, 2025

mchantry Mar 30, 2025

mchantry Mar 30, 2025

anaprietonem Mar 30, 2025

fix: Kcrps fixes for Pydantic schemas #189

Are you sure you want to change the base?

fix: Kcrps fixes for Pydantic schemas #189

Conversation

anaprietonem commented Mar 13, 2025

Description

Type of Change

Issue Number

Code Compatibility

Code Performance and Testing

Dependencies

Documentation

Additional Notes

theissenhelen Mar 19, 2025

Choose a reason for hiding this comment

theissenhelen Mar 19, 2025

Choose a reason for hiding this comment

mchantry Mar 30, 2025

Choose a reason for hiding this comment

mchantry Mar 30, 2025

Choose a reason for hiding this comment

anaprietonem Mar 30, 2025

Choose a reason for hiding this comment