Feature/pytorch lightning integration -> research/ablation-study #1

MaloOLIVIER · 2024-12-06T09:00:12Z

This pull request includes several changes aimed at improving the configuration and data handling for the Hungarian Network project. The most significant updates involve adding new configuration files for various components and refactoring the dataset class.

Configuration Updates:

.github/workflows/ci-cd.yml: Removed a TODO comment regarding upgrading the project to the latest Python version.
configs/callbacks/hnet_checkpoint.yaml: Added a new configuration for the ModelCheckpoint callback to save the last model and the top-performing model based on validation loss.
configs/callbacks/rich_model_summary.yaml: Introduced a new configuration for the RichModelSummary callback with a maximum depth of 3.
configs/logging/tensorboard.yaml: Added a new configuration for logging using TensorBoardLogger with specified logging steps and directory.
configs/metrics/f1.yaml: Added a new configuration for the F1Score metric, specifying it as a multiclass task with weighted averaging.
configs/run.yaml: Created a top-level configuration file for run.py, defining various parameters and composing other configuration files.
configs/trainer/ddp.yaml: Added a new configuration for the Trainer with distributed data parallel (DDP) strategy and GPU acceleration.

Data Handling Refactor:

hungarian_net/dataset.py: Removed the old HungarianDataset class.
hungarian_net/lightning_datamodules/hungarian_datamodule.py: Added a new HungarianDataset class with improved documentation and methods for handling class imbalance and weighted accuracy. Introduced a HungarianDataModule class to encapsulate data loading logic for training, validation, and testing.

- Refactored to use LightningModule - Updated training and validation steps with Lightning's hooks - Added PyTorch Lightning configurations - need to further integrate Lightning and Hydra by leveraging Julien' HAURET's VibraVox

- Add HungarianDataModule using LightningDataModule - Update HNetGRULightning to use torchmetrics for F1Score - Implement TensorBoard logging for loss and F1-score - Modify train_hnet.py to utilize the new DataModule and model classes

- **Enhanced Docstrings:** - Added comprehensive documentation to the class and its methods, detailing attributes and functionalities. - **Refactored F1 Metrics:** - Consolidated , , and into a single metric instance with set dynamically based on . - **Added Type Annotations:** - Included type hints for methods such as , , , and to improve code clarity and type checking. - **Updated ModelCheckpoint Monitoring:** - Changed the parameter from to to align with updated logging keys. - **Enhanced Logging Mechanism:** - Implemented a method to centralize the logging of loss and F1-score across training, validation, and test phases. - **Minor Imports Adjustment:** - Updated import statements to include from for type annotations.

- Update test_step method signature formatting in models.py for better readability - Simplify on_test_start by removing unnecessary test_config logging - Add trainer.test call in train_hnet.py to execute testing after training

…xt | removed docs from run.py

…DataModule

… | the Metric Collection is instanciated by Hydra and loads the F1Score | consequently a dictionary of metrics is logged

…up configurations - **Use Hydra to instantiate DataModule and LightningModule:** - Replaced manual instantiation with HungarianDataModule and HNetGRULightning. - **Remove hard-coded dataset paths:** - Eliminated hard-coded filename_train and filename_test to rely on Hydra configurations. - **Update imports:** - Removed unused imports such as datetime, random, sys, and warnings. - Added necessary imports like DictConfig from omegaconf, hydra from hydra.core, and Trainer from lightning.pytorch. - **Clean up Trainer instantiation:** - Removed the os.makedirs and related directory creation logic. - Updated Trainer instantiation to exclude gpus argument and ensure proper callback management. - **Enhance documentation:** - Improved docstrings for better clarity and understanding of the main function's purpose. - **Miscellaneous:** - Added type hints for better code readability and maintenance. **Benefits:** - Enhances configurability and flexibility by leveraging Hydra's powerful configuration management. - Simplifies the run.py script by removing hard-coded values and unused components. - Improves code maintainability and readability through better documentation and type hinting.

…ttentionLayer - Rename fixture model to hnetgru in conftest.py for clarity - Update tests_consistency_hnet_gru.py to use the renamed hnetgru fixture - Add test_AttentionLayer_init to tests_consistency_attention_layer.py - Remove tests from tests_scenarios_attention_layer.py - isort + black

…ting** - **.gitignore:** - Changed .coverage to .coverage* to ignore all coverage-related files. - **Configuration (configs/run.yaml):** - Increased nb_epochs from 2 to 30 for more comprehensive testing. - Set sample_range_used as a hyphen-separated string 3000-5000-15000. - **Lightning Module (hnet_gru_lightning.py):** - Added a default device parameter that automatically selects CUDA if available, otherwise CPU. - **Pytest Configuration (pytest.ini):** - Added a new marker scenarios_generate_data for tests that generate data during scenario-based testing. - **Run Script (run.py):** - Removed the device argument from hydra.utils.instantiate to handle device selection within the Lightning module itself. - **Conftest (tests/scenarios_tests/conftest.py):** - Removed the cfg fixture and custom Hydra override options to streamline configuration handling. - **Test Scripts:** - **General Improvements:** - Utilized pathlib.Path for more robust path manipulations. - **Specific Changes in test_scenarios_run.py:** - Modified sample_range_used to be a hyphen-separated string. - Removed unused imports and cleaned up fixtures. - Ensured all tests assert successful completion with clear messages. **Summary:** These changes enhance the testing framework by improving configuration flexibility, ensuring better handling of coverage files, and refining test scripts for reliability and readability. The updates facilitate more extensive and accurate testing scenarios, contributing to the robustness of the HNetGRU model development.

MaloOLIVIER added 30 commits November 29, 2024 16:18

feat: integrate PyTorch Lightning into training pipeline

c955062

- Refactored to use LightningModule - Updated training and validation steps with Lightning's hooks - Added PyTorch Lightning configurations - need to further integrate Lightning and Hydra by leveraging Julien' HAURET's VibraVox

added .python-versions to .gitignore

7442194

added setup_environment function

ce4a027

Refine test_step formatting and trainer configuration

88f9df2

- Update test_step method signature formatting in models.py for better readability - Simplify on_test_start by removing unnecessary test_config logging - Add trainer.test call in train_hnet.py to execute testing after training

factorized the architecture

3c8a293

removed unused imports

d6672cd

Ensure pytest and hydra-core are included correctly in requirements.t…

8e42266

…xt | removed docs from run.py

isort + black | TODO: need to factorize HungarianDataset in Hungarian…

d2b183b

…DataModule

started to add Hydra, need to finish

945f05c

finished to update technical debt

490dfbc

added ModelCheckpoint callback in hydra configs

d070335

added TensorBoardLogger to hydra configs

6a13e37

added run config to hydra configs

cdbcbda

added trainer to hydra configs

ef0949e

added optimizer Adam to hydra configs

4425b82

added metrics F1-score to hydra configs

cc99bf6

added HNET_GRU lightning module to hydra configs

811159b

added HungarianDataModule to hydra configs

30efaf7

updated .gitignore to avoid pushing log and checkpoints directories

f96273a

replaced torchmetrics.F1Score hardcoded metric by a Metric Collection…

4772fc9

… | the Metric Collection is instanciated by Hydra and loads the F1Score | consequently a dictionary of metrics is logged

enhanced docs of functions and classes in hungarian_datamodule.py

1c067a7

added rich>=13.9.4 to requirements.txt for RichModelSummary callback

30da96d

added pytest-cov & pytest-mock in requirements.txt

24831c2

updated imports in tests files

7b9b35d

isort + black

e897086

reshaped tests directory

4ab93e6

MaloOLIVIER added 2 commits December 5, 2024 10:23

naming convention for pytests is test_*

a2852ec

MaloOLIVIER merged commit c1bf0a7 into research/ablation_study Dec 6, 2024
0 of 2 checks passed

MaloOLIVIER deleted the feature/pytorch-lightning-integration branch December 6, 2024 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/pytorch lightning integration -> research/ablation-study #1

Feature/pytorch lightning integration -> research/ablation-study #1

MaloOLIVIER commented Dec 6, 2024

Feature/pytorch lightning integration -> research/ablation-study #1

Feature/pytorch lightning integration -> research/ablation-study #1

Conversation

MaloOLIVIER commented Dec 6, 2024

Configuration Updates:

Data Handling Refactor: