Skip to content

Conversation

@sunildkumar
Copy link
Member

@sunildkumar sunildkumar commented Feb 12, 2025

This works well.

…er token logps. Current implemention does not support flash attn yet .

- Update `_get_per_token_logps` method to support vision models with pixel values and image grid
- Modify generation and loss computation to handle vision-specific inputs
- Refactor code to improve readability and support multi-modal models
- Add pixel values and image grid parameters to various method signatures
@sunildkumar sunildkumar merged commit a5414d7 into main Aug 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant