Add tip for logging evaluation metrics during regular evaluations #4367

cam1llynha · 2025-10-29T12:54:43Z

This PR adds guidance in the DPO script on how to log and save evaluation metrics during regular evaluations, not only at the end of training.
The implementation includes an example of a custom callback (TrainerCallback) that allows logging intermediate metrics and clarifies how W&B (Weights & Biases) aggregates metrics across the evaluation dataset.
The goal is to provide clear reference for users who want to monitor metrics throughout the process without changing the core logic.

Related Issue:
Closes #2602

This PR adds a helpful comment in the DPO script explaining how to log and save evaluation metrics during regular evaluations using a custom callback. It also clarifies W&B behavior regarding metric aggregation. Related to issue huggingface#2602. Checklist: - [x] Added clear example for custom callback - [x] Clarified W&B aggregation behavior - [x] No code logic changed, only documentation tip

cam1llynha · 2025-10-29T12:58:13Z

Hi team! This PR addresses the question raised in issue #2602 by adding a clear tip on how to log evaluation metrics during regular evaluations using a custom callback.
Please let me know if any adjustments are needed. Thanks for reviewing!

qgallouedec · 2025-10-29T14:36:41Z

thanks @cam1llynha. In my opinion, this belongs to the transformers documentation as it's common to all trainers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tip for logging evaluation metrics during regular evaluations #4367

Add tip for logging evaluation metrics during regular evaluations #4367

Uh oh!

cam1llynha commented Oct 29, 2025

Uh oh!

cam1llynha commented Oct 29, 2025

Uh oh!

qgallouedec commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add tip for logging evaluation metrics during regular evaluations #4367

Are you sure you want to change the base?

Add tip for logging evaluation metrics during regular evaluations #4367

Uh oh!

Conversation

cam1llynha commented Oct 29, 2025

Uh oh!

cam1llynha commented Oct 29, 2025

Uh oh!

qgallouedec commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants