Skip to content

axrshz/gpt-oss

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 

Repository files navigation

minimal gpt-oss implementation in pytorch.

Important

this repo is only for learning purposes.

components:

  • RMSNorm (pre-norm)
  • Grouped Query Attention with attention sinks and sliding window
  • RoPE with YaRN
  • Mixture-of-Experts (MoE) with gated router
  • SwiGLU and KV-cache

reference: these banger repos by Vizuara and Hamza Elshafie

About

gpt-oss implementation in pytorch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages