Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.8k100Updated Feb 22, 2025Feb 22, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      28k200Updated Feb 18, 2025Feb 18, 2025
    • Zonos

      Public
      Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.
      Python
      Apache License 2.0
      5275.4k9013Updated Feb 18, 2025Feb 18, 2025
    • Python
      Apache License 2.0
      14570Updated Feb 5, 2025Feb 5, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      28k300Updated Feb 4, 2025Feb 4, 2025
    • Zamba2

      Public
      PyTorch implementation of models from the Zamba2 series.
      Python
      Apache License 2.0
      1617631Updated Jan 23, 2025Jan 23, 2025
    • zcookbook

      Public
      Training hybrid models for dummies.
      Python
      Apache License 2.0
      22001Updated Jan 16, 2025Jan 16, 2025
    • Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
      Python
      511610Updated Dec 3, 2024Dec 3, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      Apache License 2.0
      4.6k000Updated Nov 6, 2024Nov 6, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.6k0104Updated Aug 20, 2024Aug 20, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.6k002Updated Aug 19, 2024Aug 19, 2024
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k000Updated Jul 8, 2024Jul 8, 2024
    • Python
      Apache License 2.0
      1700Updated Jul 1, 2024Jul 1, 2024
    • mamba

      Public
      Python
      Apache License 2.0
      1.2k400Updated Jun 27, 2024Jun 27, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.1k000Updated Jun 20, 2024Jun 20, 2024
    • Python
      Apache License 2.0
      13110Updated Jun 19, 2024Jun 19, 2024
    • High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
      C++
      MIT License
      424100Updated Jun 11, 2024Jun 11, 2024
    • Dataset for the temporal memory tests
      0500Updated Jun 4, 2024Jun 4, 2024
    • Robust recipes to align language models with human and AI preferences
      Python
      Apache License 2.0
      433000Updated Jun 3, 2024Jun 3, 2024
    • Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
      Python
      Apache License 2.0
      200000Updated Mar 8, 2024Mar 8, 2024
    • Code repository for Black Mamba
      Python
      1823750Updated Feb 8, 2024Feb 8, 2024
    • 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
      Rust
      Apache License 2.0
      846101Updated Feb 3, 2024Feb 3, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.3k000Updated Nov 2, 2023Nov 2, 2023
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k000Updated Nov 1, 2023Nov 1, 2023