introducing pereceiver-io style module for embedding layer #438

csjfwang · 2025-07-03T09:11:33Z

Description

For channel-wise attention on ERA5, we currently use only 96 channels. As more channels are incorporated in the future, the computational burden will increase significantly. To address this, we introduce a Perceiver IO module before the channel-wise attention stage.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

Issue Number

Open #225

Code Compatibility

I have performed a self-review of my code

Code Performance and Testing

I ran the uv run train and (if necessary) uv run evaluate on a least one GPU node and it works
If the new feature introduces modifications at the config level, I have made sure to have notified the other software developers through Mattermost and updated the paths in the $WEATHER_GENERATOR_PRIVATE directory

Dependencies

I have ensured that the code is still pip-installable after the changes and runs
I have tested that new dependencies themselves are pip-installable.
I have not introduced new dependencies in the inference portion of the pipeline

Documentation

My code follows the style guidelines of this project
I have updated the documentation and docstrings to reflect the changes
I have added comments to my code, particularly in hard-to-understand areas

Additional Notes

…stream replaced cross-attentionheadvarlen with cross-attentionhead

ecmwf#265) * Adding base class for data readers, with anemoi data reader as example. Also implements (trivial) required changes in MultiStreamDataSampler. * Ruffed * Fixed various issues. Training with loss starting to converge. Much more testing needed. * Fixing bug where cf.shuffle was not passed to data loader. * Cleaned up problems that occured with arbitrary length window and offsets. Also introduced base class for time stepped datasets and moved function to get dataset indices there. Other smaller code improvements and documentation. * Changes due to changed interface in TimeWindowHandler * Fixed typo * Fixed documentation * Renaming * Adapted obs data reader * [100] Refactor of data readers into clessig's branch (ecmwf#306) * changes * changes * configs * changes * changes * changes * changes * better interface * comments * changes * Fixed formatting of comment * Reenabled obs data reader * Switched to use of TRAIN and VAL stages flags. * Fixed problem with target and source channels that can differ between train and val * Clean up and also added check_reader_data * Fixed problem that occured when dataset has no overlap with training / validation range. Also some restructuring to handle this properly. * Added check for proper time inclusion in interval to check_data_reader. Code adaptations for handling of empty dataset. * Fixed handling of subsampling_rate / frequency for anemoi dataset. * Cleaned up special case handling and removed old comments * Added missing handling of special case where t_win is inbetween two timesteps * Renamed FESOM data reader and adapted to base class etc * Fixed incorrect handling of shuffling * Re-enabled FESOM data reader * Added warning when dataset does not overlap time window of data loader * Fixed missing check * Adding fixed FESOM data reader. Needs to be verified * Removing spurious files * small changes * style * restored defaults * Removing log messages that are too verbose * Fixing sub-optimal solution with placement of warning * Removing logging that breaks metric plotting (and seesm not sensible) * Reenabling performance logging * removing incorrect formatting * Removed unused variable * Fixed bug in evaluation with missing usage of new stages * Removing stream files that need to be considered in more detail before going to develop * Removing debug dependency * Restoring defaults * Introduced stream_info to base class. Cleanup. --------- Co-authored-by: Timothy Hunter <[email protected]>

sophie-xhonneux · 2025-07-21T08:09:20Z