feat: Add Interleaved Trainer implementation #3107

ucalyptus2 · 2025-03-18T16:22:31Z

What does this PR do?

This PR introduces a new InterleaveTrainer class that enables alternating between different training strategies within the same training loop. This implementation allows for more flexible training patterns where different optimization objectives can be interleaved during model training.

Key additions:

Add InterleaveTrainer class and configuration
- Implements a trainer that can alternate between different training strategies
- Provides configurable scheduling of training phases
- Supports seamless integration with existing TRL trainers
Add unit tests for interleaved training
- Comprehensive test coverage for trainer functionality
- Tests for configuration validation
- Integration tests with different training scenarios
Update init.py files to expose new trainer
- Make InterleaveTrainer accessible through the main TRL package
- Maintain consistent import patterns with other trainers
Implement trainer configuration with InterleaveConfig
- Flexible configuration options for defining training schedules
- Support for customizing phase transitions
- Type-safe configuration validation

Technical Details

The InterleaveTrainer allows users to define multiple training phases that can be alternated during the training process. This is particularly useful for scenarios where you want to:

Alternate between different learning objectives
Switch between different datasets during training
Implement curriculum learning strategies
Balance multiple training goals in a controlled manner

Before submitting

Did you read the contributor guideline
Did you write any new necessary tests?
Did you make sure to update the documentation with your changes?

Who can review?

Anyone familiar with TRL's trainer implementations and interested in advanced training strategies. @huggingface/trl-core-team would be great reviewers for this feature.

- Add InterleaveTrainer class and configuration - Add unit tests for interleaved training - Update __init__.py files to expose new trainer - Implement trainer configuration with InterleaveConfig

qgallouedec · 2025-04-02T16:56:14Z

Hi, thanks for your contribution! I’m unsure if the potential profit outweighs the added complexity. For now, I’m putting this on hold to gauge community interest in supporting this feature.

ucalyptus and others added 2 commits March 18, 2025 16:20

feat: Add Interleaved Trainer implementation

3f35045

- Add InterleaveTrainer class and configuration - Add unit tests for interleaved training - Update __init__.py files to expose new trainer - Implement trainer configuration with InterleaveConfig

Merge branch 'huggingface:main' into main

865665e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add Interleaved Trainer implementation #3107

feat: Add Interleaved Trainer implementation #3107

ucalyptus2 commented Mar 18, 2025

qgallouedec commented Apr 2, 2025

feat: Add Interleaved Trainer implementation #3107

Are you sure you want to change the base?

feat: Add Interleaved Trainer implementation #3107

Conversation

ucalyptus2 commented Mar 18, 2025

What does this PR do?

Technical Details

Before submitting

Who can review?

qgallouedec commented Apr 2, 2025