New base trainer #4

sunildkumar · 2025-02-12T19:23:53Z

This works well.

…er token logps. Current implemention does not support flash attn yet . - Update `_get_per_token_logps` method to support vision models with pixel values and image grid - Modify generation and loss computation to handle vision-specific inputs - Refactor code to improve readability and support multi-modal models - Add pixel values and image grid parameters to various method signatures

sunildkumar added 6 commits February 12, 2025 10:11

setup file structure

1220d35

made it through trainer init

8683502

was able to train on formatting

60530aa

pushing for leo

b2397e1

ruff

2823594

sunildkumar merged commit a5414d7 into main Aug 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New base trainer #4

New base trainer #4

Uh oh!

sunildkumar commented Feb 12, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

New base trainer #4

New base trainer #4

Uh oh!

Conversation

sunildkumar commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

sunildkumar commented Feb 12, 2025 •

edited

Loading