Skip to content
Change the repository type filter

All

    Repositories list

    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.7k6.6k28791Updated Oct 5, 2024Oct 5, 2024
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      9966.9k5521Updated Oct 3, 2024Oct 3, 2024
    • Adds GaLore style projection wrappers to optax optimizers
      Python
      MIT License
      0300Updated Oct 3, 2024Oct 3, 2024
    • Equinox implementation of llama3 and llama3.1
      Python
      MIT License
      0400Updated Oct 3, 2024Oct 3, 2024
    • Python
      Apache License 2.0
      76400Updated Oct 3, 2024Oct 3, 2024
    • Jupyter Notebook
      MIT License
      0100Updated Oct 3, 2024Oct 3, 2024
    • cupbearer

      Public
      A library for mechanistic anomaly detection
      Python
      MIT License
      9500Updated Oct 3, 2024Oct 3, 2024
    • Efficiently computing & storing token n-grams from large corpora
      Rust
      MIT License
      31500Updated Oct 2, 2024Oct 2, 2024
    • sgdensity

      Public
      Computing the implicit probability densities that SGD assigns to networks
      Python
      Apache License 2.0
      0000Updated Oct 1, 2024Oct 1, 2024
    • Jupyter Notebook
      54405Updated Oct 1, 2024Oct 1, 2024
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      331821410Updated Sep 30, 2024Sep 30, 2024
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      6402Updated Sep 30, 2024Sep 30, 2024
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      71900Updated Sep 29, 2024Sep 29, 2024
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      3468080Updated Sep 24, 2024Sep 24, 2024
    • w2s

      Public
      Python
      MIT License
      01510Updated Sep 24, 2024Sep 24, 2024
    • ccs

      Public
      Python
      MIT License
      6433Updated Sep 24, 2024Sep 24, 2024
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      5.8k6500Updated Sep 20, 2024Sep 20, 2024
    • monkfish

      Public
      Python
      MIT License
      1400Updated Sep 18, 2024Sep 18, 2024
    • Understanding how features learned by neural networks evolve throughout training
      Python
      MIT License
      13001Updated Sep 16, 2024Sep 16, 2024
    • A library for efficient patching and automatic circuit discovery.
      Python
      9000Updated Sep 16, 2024Sep 16, 2024
    • sae

      Public
      Sparse autoencoders
      Python
      MIT License
      4031231Updated Sep 9, 2024Sep 9, 2024
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1632.2k234Updated Aug 21, 2024Aug 21, 2024
    • Python
      0200Updated Aug 2, 2024Aug 2, 2024
    • Utilities to use the Hugging Face Hub API
      TypeScript
      MIT License
      213100Updated Jul 31, 2024Jul 31, 2024
    • aria

      Public
      Python
      Apache License 2.0
      113900Updated Jul 18, 2024Jul 18, 2024
    • Python
      0000Updated Jul 3, 2024Jul 3, 2024
    • Script for downloading GitHub.
      Python
      418705Updated Jul 1, 2024Jul 1, 2024
    • CAA

      Public
      Steering Llama 2 with Contrastive Activation Addition
      Jupyter Notebook
      MIT License
      29000Updated Jun 30, 2024Jun 30, 2024
    • Experiments in transformer knowledge and reasoning
      Python
      MIT License
      12000Updated Jun 21, 2024Jun 21, 2024
    • Engineering the state of RNN language models (Mamba, RWKV, etc.)
      Jupyter Notebook
      MIT License
      23100Updated May 25, 2024May 25, 2024