Skip to content
Change the repository type filter

All

    Repositories list

    • Baselines for Neural MMO -- new users should treat this repo as a starter project
      Python
      MIT License
      41501Updated Nov 26, 2024Nov 26, 2024
    • Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
      Python
      MIT License
      2661500Updated May 30, 2024May 30, 2024
    • DRLX

      Public
      Diffusion Reinforcement Learning Library
      Python
      MIT License
      817781Updated Feb 13, 2024Feb 13, 2024
    • trlx

      Public
      A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
      Python
      MIT License
      4734.6k8115Updated Jan 8, 2024Jan 8, 2024
    • autocrit

      Public
      A repository for transformer critique learning and generation
      Python
      188833Updated Dec 7, 2023Dec 7, 2023
    • OpenELM

      Public
      Evolution Through Large Models
      Python
      MIT License
      8671061Updated Nov 15, 2023Nov 15, 2023
    • Jupyter Notebook
      Apache License 2.0
      62201Updated Aug 27, 2023Aug 27, 2023
    • Jupyter Notebook
      1100Updated Aug 10, 2023Aug 10, 2023
    • magiCARP is an API used for crossencoder training.
      Python
      41040Updated Jul 27, 2023Jul 27, 2023
    • tinypar

      Public
      Python
      9000Updated Jul 16, 2023Jul 16, 2023
    • Polygraph

      Public
      RLHF Mechanistic Interpretability and Deception
      MIT License
      2600Updated Jul 14, 2023Jul 14, 2023
    • squeakily

      Public
      A library for squeakily cleaning and filtering language datasets.
      Jupyter Notebook
      Apache License 2.0
      104522Updated Jul 10, 2023Jul 10, 2023
    • maxtext

      Public
      A simple, performant and scalable Jax LLM!
      Python
      Apache License 2.0
      309100Updated Jun 30, 2023Jun 30, 2023
    • sft

      Public
      Python
      4201Updated Jun 29, 2023Jun 29, 2023
    • Python
      MIT License
      5800Updated Jun 21, 2023Jun 21, 2023
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language model based chatbots.
      Python
      Apache License 2.0
      4.6k400Updated Apr 26, 2023Apr 26, 2023
    • This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
      Jupyter Notebook
      Apache License 2.0
      42500Updated Apr 21, 2023Apr 21, 2023
    • pilev2

      Public
      Python
      MIT License
      101113Updated Mar 24, 2023Mar 24, 2023
    • goosebox

      Public
      sandboxed eval server for running code snippets
      MIT License
      2100Updated Mar 1, 2023Mar 1, 2023
    • cheese

      Public
      Used for adaptive human in the loop evaluation of language and embedding models.
      Python
      MIT License
      2530630Updated Mar 1, 2023Mar 1, 2023
    • Code-Pile

      Public
      This repository contains all the code for collecting large scale amounts of code from GitHub.
      Python
      MIT License
      30105184Updated Feb 17, 2023Feb 17, 2023
    • Python
      MIT License
      63413Updated Jan 29, 2023Jan 29, 2023
    • Stuff related to scraping the Code Review StackExchange
      Python
      61101Updated Jan 19, 2023Jan 19, 2023
    • For experiments involving instruct gpt. Currently used for documenting open research questions.
      MIT License
      471250Updated Nov 8, 2022Nov 8, 2022
    • Code used for sourcing and cleaning the BigScience ROOTS corpus
      E
      Apache License 2.0
      40400Updated Nov 6, 2022Nov 6, 2022
    • 👀
      MIT License
      1320Updated Oct 7, 2022Oct 7, 2022
    • Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
      Python
      Other
      60200Updated Jul 28, 2022Jul 28, 2022