Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Updated OpenEnv docs
#4418 opened Oct 31, 2025 by sergiopaniego Draft
8 tasks
Create "Talks" subsection
#4414 opened Oct 31, 2025 by sergiopaniego Loading…
5 tasks
Add On-Policy Distillation from thinking labs to paper index.
#4410 opened Oct 30, 2025 by pramodith Loading…
4 of 5 tasks
Gold refactor
#4373 opened Oct 29, 2025 by qgallouedec Draft
5 tasks
Openenv wordle example
#4357 opened Oct 28, 2025 by burtenshaw Loading…
[OpenENV] Openenv rollout_func signature proposal
#4344 opened Oct 27, 2025 by kashif Loading…
5 tasks
wip - env
#4320 opened Oct 22, 2025 by qgallouedec Loading…
5 tasks
refactor: simplify parameter freezing in modeling_base.py
#4305 opened Oct 20, 2025 by Ki-Seki Loading…
2 of 5 tasks
GRPO: ScaleRL -> Support casting LM Head to FP32
#4303 opened Oct 18, 2025 by pramodith Loading…
4 of 5 tasks
[SFT] Log mean token accuracy from Liger kernel
#4302 opened Oct 18, 2025 by kashif Loading…
5 tasks
Tool call
#4300 opened Oct 18, 2025 by qgallouedec Draft
5 tasks
Add CISPO loss option and documentation
#4298 opened Oct 16, 2025 by gustavorubim Loading…
Fix DPO Trainer Bug For Qwen2-VL (Issue 2660)
#4257 opened Oct 11, 2025 by FabianSchuetze Loading…
1 of 3 tasks
Online-dpo-ben
#4252 opened Oct 10, 2025 by burtenshaw Draft
5 tasks
Add support for Python 3.14
#4225 opened Oct 8, 2025 by albertvillanova Loading…
Update max_length explanation for VLM trainers
#4220 opened Oct 7, 2025 by sergiopaniego Loading…
5 tasks
Add trust_remote_code to GRPOConfig
#4186 opened Oct 1, 2025 by muupan Loading…
3 of 4 tasks
🐍 Drop Python 3.9
#4183 opened Sep 30, 2025 by qgallouedec Loading…
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
ProTip! Exclude everything labeled bug with -label:bug.