feat(reward): add language consistency reward function #683

izlley · 2025-06-26T06:56:46Z

Description

This pull request introduces a new reward function, lang_consistency reward, to enhance the linguistic consistency of model outputs during reinforcement learning (e.g., GRPO).

Motivation

In multilingual or even monolingual contexts, it's crucial for a model to adhere to the target language specified in a prompt or conversation. This reward function directly incentivizes this behavior by assigning a positive reward (1.0) only when the language of the generated completion matches the expected language. This helps prevent language drift and ensures more reliable and predictable model behavior.

Inspired by the "language consistency reward" concept from the DeepSeek-R1 paper, this function provides a valuable tool for fine-tuning models where strict language adherence is required.

feat(reward): add language consistency reward function

fb71fde

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(reward): add language consistency reward function #683

feat(reward): add language consistency reward function #683

Uh oh!

izlley commented Jun 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

feat(reward): add language consistency reward function #683

Are you sure you want to change the base?

feat(reward): add language consistency reward function #683

Uh oh!

Conversation

izlley commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation

Uh oh!

Uh oh!

izlley commented Jun 26, 2025 •

edited

Loading