Adding sampling parameters for vllm generation #3210

shaipranesh2 · 2025-04-02T12:10:45Z

What does this PR do?

Fixes issue #3201 . Added support in GRPO config files to set additional parameters of vllm sampling.

shaipranesh2 · 2025-04-03T13:44:15Z

@qgallouedec This PR is ready for review :)

qgallouedec

Thanks @shaipranesh2! I've added a few comments

qgallouedec · 2025-04-05T04:55:39Z

trl/extras/vllm_client.py

+        stop: list[str] = [],
+        stop_token_ids: list[int] = [],
+        bad_words: list[str] = [],


Using mutable default arguments can lead to unexpected behavior because the same list is shared across all function calls.
Maybe replace it with None, and in the function:

stop = stop or []

qgallouedec · 2025-04-05T04:58:23Z

trl/trainer/grpo_config.py

+            "help": "Minimum length of the prompt. If the prompt is shorter than this value, it will be truncated left."
+        },


How truncation would solve it?

qgallouedec · 2025-04-05T04:59:30Z

trl/trainer/grpo_config.py

+    repetition_penalty: int = field(
+        default=1,
+        metadata={
+            "help": "List of text prompts for which the model will generate completions."


I think there is a mismatch here

qgallouedec · 2025-04-05T05:01:17Z

trl/trainer/grpo_config.py

For parameters specific to vllm, I would name it with a vllm prefix. Ideally some are shared by transformers and vllm (like temperature), in such case, they don't need a prefix

shaipranesh2 added 4 commits April 2, 2025 15:31

Adding sampling parameters

b07ef97

Adding parameters for vllm in grpo_trainer

1f38f3d

Adding sampling parameters for vllm in grpo_config

3c24d0a

Merge branch 'main' into main

af9d271

qgallouedec reviewed Apr 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding sampling parameters for vllm generation #3210

Adding sampling parameters for vllm generation #3210

shaipranesh2 commented Apr 2, 2025

shaipranesh2 commented Apr 3, 2025

qgallouedec left a comment

qgallouedec Apr 5, 2025

qgallouedec Apr 5, 2025

qgallouedec Apr 5, 2025

qgallouedec Apr 5, 2025

		"help": "Minimum length of the prompt. If the prompt is shorter than this value, it will be truncated left."
		},

Adding sampling parameters for vllm generation #3210

Are you sure you want to change the base?

Adding sampling parameters for vllm generation #3210

Conversation

shaipranesh2 commented Apr 2, 2025

What does this PR do?

shaipranesh2 commented Apr 3, 2025

qgallouedec left a comment

Choose a reason for hiding this comment

qgallouedec Apr 5, 2025

Choose a reason for hiding this comment

qgallouedec Apr 5, 2025

Choose a reason for hiding this comment

qgallouedec Apr 5, 2025

Choose a reason for hiding this comment

qgallouedec Apr 5, 2025

Choose a reason for hiding this comment