Skip to content

Conversation

yuankuns
Copy link

@yuankuns yuankuns commented Oct 3, 2025

  1. Enable flash attention 2 backward
  2. layout bhsd/bshd
  3. dtype fp16/bf16
  4. is_causal WIP,
  5. GQA WIP
  6. NUM_HEAD=1 WIP

@Antonyvance Antonyvance added the redesign required Implementation require a redesign label Oct 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

redesign required Implementation require a redesign

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants