Skip to content
Change the repository type filter

All

    Repositories list

    • A package for sampling from Gibbs distributions during inference with LLMs.
      Python
      Apache License 2.0
      1810Updated Apr 11, 2025Apr 11, 2025
    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      Other
      248000Updated Apr 11, 2025Apr 11, 2025
    • zsb

      Public
      Python
      0100Updated Apr 10, 2025Apr 10, 2025
    • treqa

      Public
      LLM-based QAG framework for MT Evaluation
      0010Updated Apr 9, 2025Apr 9, 2025
    • A fork of lm-eval-harness.
      Python
      MIT License
      2.3k000Updated Apr 3, 2025Apr 3, 2025
    • Ongoing research training transformer models at scale
      Python
      Other
      2.7k101Updated Apr 3, 2025Apr 3, 2025
    • Python
      42900Updated Apr 3, 2025Apr 3, 2025
    • A PyTorch native library for large model training
      Python
      BSD 3-Clause "New" or "Revised" License
      332000Updated Apr 1, 2025Apr 1, 2025
    • adasplash

      Public
      AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)
      Python
      MIT License
      0100Updated Mar 22, 2025Mar 22, 2025
    • fy-vi

      Public
      Jupyter Notebook
      0000Updated Mar 21, 2025Mar 21, 2025
    • doce

      Public
      This is the a repo of DOCE
      Python
      Apache License 2.0
      0200Updated Mar 14, 2025Mar 14, 2025
    • Repository containing code to reproduce results of the paper "Sparse Activations as Conformal Predictors".
      Jupyter Notebook
      0100Updated Feb 25, 2025Feb 25, 2025
    • latim

      Public
      Jupyter Notebook
      MIT License
      0400Updated Feb 24, 2025Feb 24, 2025
    • CHM-Net

      Public
      Modern Hopfield Networks with Continuous-Time Memories
      Python
      MIT License
      0000Updated Feb 21, 2025Feb 21, 2025
    • 0000Updated Feb 17, 2025Feb 17, 2025
    • \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation
      Python
      MIT License
      01300Updated Feb 14, 2025Feb 14, 2025
    • ssm-mt

      Public
      Jupyter Notebook
      0100Updated Feb 8, 2025Feb 8, 2025
    • Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
      Python
      Apache License 2.0
      322000Updated Feb 4, 2025Feb 4, 2025
    • HFYN

      Public
      Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval
      Jupyter Notebook
      MIT License
      0100Updated Jan 31, 2025Jan 31, 2025
    • Jupyter Notebook
      1200Updated Oct 15, 2024Oct 15, 2024
    • Python
      0200Updated Oct 10, 2024Oct 10, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      989000Updated Sep 26, 2024Sep 26, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      176000Updated Sep 19, 2024Sep 19, 2024
    • Python
      76427Updated Aug 29, 2024Aug 29, 2024
    • DeepSPIN's submission to SIGMORPHON 2020
      Python
      MIT License
      1511Updated Jul 25, 2024Jul 25, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.7k102Updated Jul 12, 2024Jul 12, 2024
    • SSHN

      Public
      Sparse and Structured Hopfield Networks
      Python
      MIT License
      0300Updated Jul 4, 2024Jul 4, 2024
    • entmax

      Public
      The entmax mapping and its loss, a family of sparse softmax alternatives.
      Python
      MIT License
      46430122Updated Jun 22, 2024Jun 22, 2024
    • COMET

      Public
      A Neural Framework for MT Evaluation
      Python
      Apache License 2.0
      89000Updated Jun 11, 2024Jun 11, 2024
    • robust-mt

      Public
      0000Updated Mar 6, 2024Mar 6, 2024