chore(python-deps): bump trl from 1.4.0 to 1.5.0#5964
Open
dependabot[bot] wants to merge 2 commits into
Open
Conversation
Bumps [trl](https://github.com/huggingface/trl) from 1.4.0 to 1.5.0. - [Release notes](https://github.com/huggingface/trl/releases) - [Changelog](https://github.com/huggingface/trl/blob/main/RELEASE.md) - [Commits](huggingface/trl@v1.4.0...v1.5.0) --- updated-dependencies: - dependency-name: trl dependency-version: 1.5.0 dependency-type: direct:development update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>
Contributor
Author
LabelsThe following labels could not be found: Please fix the above issues or remove invalid values from |
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Contributor
|
✅ Constraint-dependencies updated Updated |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Bumps trl from 1.4.0 to 1.5.0.
Release notes
Sourced from trl's releases.
... (truncated)
Commits
bd1e73fRelease: v1.5 (#5835)fb9cb79Add Qwen3.5 Think/NoThink training chat templates with generation markers (#5...9e80cabFixOpenRewardSpecomitting task‑scoped tools during rollout binding (fixes...7877695Migrate tests to Qwen3.5 Think/NoThink fixtures (#5821)0fcc5e2Add tiny Qwen3.5 Think/NoThink fixture generation scripts (#5819)43bd8f5Align KTO with DPO: Align _compute_loss_liger flow (#5816)cc4a0ffAlign and simplify the stable training scripts (#5812)4711a21Fixmetric_for_best_modelfor trainer-specific eval metrics (#5811)909d090Fix generate_batch: inference tensors block inplace ops in background thread ...d0e8b8cAlign KTO with DPO: Align compute_loss flow (#5810)Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
@dependabot rebasewill rebase this PR@dependabot recreatewill recreate this PR, overwriting any edits that have been made to it@dependabot show <dependency name> ignore conditionswill show all of the ignore conditions of the specified dependency@dependabot ignore this major versionwill close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this minor versionwill close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this dependencywill close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)