Skip to content

Bug fix: logits-based training crash #26

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

hunter2009pf
Copy link

  1. remove two parameters that do not exist in the SFTTrainer initializer when using the newest version of trl package;
  2. add parameter num_items_in_batch to the function common_loss of class LogitsTrainer so that we can run the training process completely;
  3. fix ValueError: make sure to call tokenizer.padding_side = 'left' before tokenizing the input.

1. remove two parameters that do not exist in the SFTTrainer initializer when using the newest version of trl package;
2. fix ValueError: make sure to call `tokenizer.padding_side = 'left'` before tokenizing the input.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant