EleutherAI ORG

@EleutherAI ·The Internet ·www.eleuther.ai

182 repos
4.3k followers
0 following

Python 72%
Jupyter Notebook 15%
JavaScript 5%
HTML 5%
Cuda 1%

All public repos (182)

Show forks Show archived

lm-evaluation-harness

A framework for few-shot evaluation of language models.

The LM Evaluation Harness is a Python framework for benchmarking AI language models on 60+ standardized tasks in a reproducible way, it powers the Hugging Face Open LLM Leaderboard.

Python ★ 13k 18d ago
Explain →
gpt-neo ▣

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python ★ 8.3k 4y ago
Explain →
gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

GPT-NeoX is a Python toolkit for training billion-parameter AI language models from scratch on GPU clusters, not for chatting with existing models, but for organizations building new ones at research scale.

Python ★ 7.4k 9d ago
Explain →
pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook ★ 2.8k 7mo ago
Explain →
the-pile

No description.

Python ★ 1.7k 3y ago
Explain →
math-lm

No description.

Python ★ 1.1k 2y ago
Explain →
cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python ★ 845 3mo ago
Explain →
sparsify

Sparsify transformers with SAEs and transcoders

Python ★ 727 5d ago
Explain →
polyglot

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

★ 487 2y ago
Explain →
DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Python ★ 431 4y ago
Explain →
vqgan-clip

No description.

Jupyter Notebook ★ 353 4y ago
Explain →
delphi

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python ★ 262 5d ago
Explain →
concept-erasure

Erasing concepts from neural representations with provable guarantees

Python ★ 255 1y ago
Explain →
elk

Keeping language models honest by directly eliciting knowledge encoded in their activations.

Python ★ 220 5d ago
Explain →
nanoGPT-mup ⑂

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python ★ 196 5mo ago
Explain →
oslo

OSLO: Open Source for Large-scale Optimization

Python ★ 175 2y ago
Explain →
DeeperSpeed ⑂

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Python ★ 173 8mo ago
Explain →
lm_perplexity

No description.

Python ★ 164 5y ago
Explain →
knowledge-neurons

A library for finding knowledge neurons in pretrained transformer models.

Python ★ 160 4y ago
Explain →
pyfra

Python Research Framework

Python ★ 107 3y ago
Explain →
aria

Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)

Python ★ 105 1mo ago
Explain →
github-downloader ⑂

Script for downloading GitHub.

Python ★ 99 1y ago
Explain →
openwebtext2

No description.

Python ★ 95 3y ago
Explain →
dps

Data processing system for polyglot

Python ★ 93 2y ago
Explain →
stackexchange-dataset

Python tools for processing the stackexchange data dumps into a text dataset for Language Models

Python ★ 87 2y ago
Explain →
improved-t5

Experiments for efforts to train a new and improved t5

Python ★ 76 2y ago
Explain →
minetest ⑂

Minetest is an open source voxel game engine with easy modding and game creation

C++ ★ 75 2y ago
Explain →
aria-amt

Efficient and robust implementation of seq-to-seq automatic piano transcription.

Python ★ 70 6mo ago
Explain →
project-menu

See the issue board for the current status of active and prospective projects!

★ 65 4y ago
Explain →
bergson

Mapping out the "memory" of neural nets with data attribution

Python ★ 61 4d ago
Explain →
magiCARP

One stop shop for all things carp

Python ★ 58 3y ago
Explain →
semantic-memorization

No description.

Jupyter Notebook ★ 44 1y ago
Explain →
tqdm-multiprocess

Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for python logging.

Python ★ 43 5y ago
Explain →
features-across-time

Understanding how features learned by neural networks evolve throughout training

Python ★ 41 1y ago
Explain →
mp_nerf

Massively-Parallel Natural Extension of Reference Frame

Jupyter Notebook ★ 34 3y ago
Explain →
hae-rae

No description.

★ 33 2y ago
Explain →
rnngineering

Engineering the state of RNN language models (Mamba, RWKV, etc.)

Jupyter Notebook ★ 32 2y ago
Explain →
elk-generalization

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Python ★ 31 2y ago
Explain →
steering-llama3

No description.

Python ★ 30 1y ago
Explain →
tokengrams

Efficiently computing & storing token n-grams from large corpora

Rust ★ 27 6d ago
Explain →
clt-training ⑂

Sparsify transformers with cross-layer transcoders

Python ★ 26 7mo ago
Explain →
pile-pubmedcentral

A script for collecting the PubMed Central dataset in a language modelling friendly format.

Python ★ 26 5y ago
Explain →
training-jacobian

No description.

Jupyter Notebook ★ 24 1y ago
Explain →
w2s

No description.

Python ★ 24 1y ago
Explain →
mdl

Minimum Description Length probing for neural network representations

Python ★ 20 1y ago
Explain →
deep-ignorance

No description.

Python ★ 19 5mo ago
Explain →
polyglot-data

data related codebase for polyglot project

Python ★ 19 3y ago
Explain →
BIG-bench ⑂

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python ★ 19 4y ago
Explain →
best-download

URL downloader supporting checkpointing and continuous checksumming.

Python ★ 19 2y ago
Explain →
pile_dedupe

Pile Deduplication Code

Python ★ 18 3y ago
Explain →
attribute ⑂

No description.

Python ★ 16 7mo ago
Explain →
latent-video-diffusion

Latent video diffusion

Python ★ 16 2y ago
Explain →
NeMo ⑂

NeMo: a toolkit for conversational AI

Python ★ 16 3y ago
Explain →
pile-cc ⑂

No description.

★ 16 4y ago
Explain →
text-generation-testing-ui

Web app for demoing the EAI models

JavaScript ★ 16 4y ago
Explain →
exploring-contrastive-topology

No description.

Jupyter Notebook ★ 15 4y ago
Explain →
polyapprox

Closed-form polynomial approximations to neural networks

Python ★ 13 1y ago
Explain →
pilev2

No description.

Python ★ 13 3y ago
Explain →
pile-literotica

Download, parse, and filter data from Literotica. Data-ready for The-Pile.

Python ★ 12 5y ago
Explain →
datasets ⑂

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

★ 12 5y ago
Explain →
radioactive-lab

Adapting the "Radioactive Data" paper to work for text models

Python ★ 12 5y ago
Explain →
lm_dataformat ⑂

No description.

Python ★ 11 2y ago
Explain →
transformer-reasoning ⑂

Experiments in transformer knowledge and reasoning

Jupyter Notebook ★ 10 1y ago
Explain →
architecture-objective ⑂

No description.

Python ★ 10 3y ago
Explain →
equivariance

A framework for implementing equivariant DL

Jupyter Notebook ★ 10 5y ago
Explain →
djinn

Generating, validating and running exploitable verifiable coding problems

Python ★ 10 5mo ago
Explain →
hn-scraper

No description.

Python ★ 9 5y ago
Explain →
llemma-sample-explorer

Sample explorer tool for the Llemma models.

HTML ★ 9 2y ago
Explain →
attention-probes

Linear probes with attention weighting

Python ★ 8 10mo ago
Explain →
equinox-llama

Equinox implementation of llama3 and llama3.1

Python ★ 8 1y ago
Explain →
GPTeacher ⑂

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

★ 8 3y ago
Explain →
minetest-baselines

Baseline agents for Minetest tasks.

Python ★ 8 2y ago
Explain →
website

New website for EleutherAI based on Hugo static site generator

HTML ★ 8 3d ago
Explain →
pile-uspto

A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.

Python ★ 8 5y ago
Explain →
tyche

Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors

Jupyter Notebook ★ 8 1y ago
Explain →
tagged-pile

Part-of-Speech Tagging for the Pile and RedPajama

Python ★ 8 3y ago
Explain →
ccs

No description.

Python ★ 7 1y ago
Explain →
multimodal-fid

No description.

Python ★ 7 3y ago
Explain →
aria-utils

MIDI tokenizers and pre-processing utils.

Python ★ 6 5d ago
Explain →
cupbearer ⑂

A library for mechanistic anomaly detection

Jupyter Notebook ★ 6 1y ago
Explain →
weak-to-strong ⑂

No description.

Python ★ 6 2y ago
Explain →
trlx ⑂

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python ★ 6 2y ago
Explain →
minetest-interpretabilty-notebook

Jupyter notebook for the interpretablity section of the minetester blog post

Jupyter Notebook ★ 6 2y ago
Explain →
CodeCARP

Data collection pipeline for CodeCARP. Includes PyCharm plugins.

★ 6 4y ago
Explain →
pile-enron-emails

A script for collecting the Enron Emails dataset in a language modelling friendly format.

Python ★ 6 5y ago
Explain →
LLM-Markov-Chains

Project github for LLM Markov Chains Project

★ 6 3y ago
Explain →
pile-cc-filtering

The code used to filter CC data for The Pile

Python ★ 6 5y ago
Explain →
clearnets

No description.

Python ★ 5 1y ago
Explain →
optax-galore

Adds GaLore style projection wrappers to optax optimizers

Python ★ 5 1y ago
Explain →
architecture-experiments

Repository to host architecture experiments and development using Paxml and Praxis

Python ★ 5 3y ago
Explain →
FLAN ⑂

No description.

Python ★ 5 3y ago
Explain →
thonkenizers

yes

★ 5 4y ago
Explain →
visual-grounding

Visually ground GPT-Neo 1.3b and 2.7b

Python ★ 5 5y ago
Explain →
eleutherai.github.io

This is the Hugo generated website for eleuther.ai. The source of this build is new-website repo.

HTML ★ 5 5y ago
Explain →
pile-explorer

For exploring the data and documenting its limitations

Python ★ 5 5y ago
Explain →
open-r1 ⑂

Fully open reproduction of DeepSeek-R1

Python ★ 5 1y ago
Explain →
scalable-elicitation ⑂

The code used in "Balancing Label Quantity and Quality for Scalable Elicitation"

Jupyter Notebook ★ 4 1y ago
Explain →
monkfish ⑂

No description.

Python ★ 4 1y ago
Explain →
alignment-handbook ⑂

Robust recipes for to align language models with human and AI preferences

★ 4 2y ago
Explain →
Unpaired-Image-Generation

Project Repo for Unpaired Image Generation project

★ 4 2y ago
Explain →
lm-scope

No description.

Jupyter Notebook ★ 4 4y ago
Explain →
megatron-3d ▣

No description.

Python ★ 4 5y ago
Explain →
pile-website ⑂

No description.

HTML ★ 4 2y ago
Explain →
pile-ubuntu-irc

A script for collecting the Ubuntu IRC dataset in a language modelling friendly format.

Python ★ 4 5y ago
Explain →
emergent-misalignment ⑂

No description.

Python ★ 4 3d ago
Explain →
ngrams-across-time

No description.

Jupyter Notebook ★ 4 1y ago
Explain →
rtopk

https://github.com/xiexi51/RTopK PyTorch wrapper

Cuda ★ 3 1y ago
Explain →
sae_overlap

Acompanying code for our research on SAE feature overlap when trained on different seeds.

Jupyter Notebook ★ 3 1y ago
Explain →
variance-across-time

Studying the variance in neural net predictions across training time

Python ★ 3 2y ago
Explain →
EvilModel

A replication of "EvilModel 2.0: Bringing Neural Network Models into Malware Attacks"

★ 3 2y ago
Explain →
eai-prompt-gallery

Library of interesting prompt generations

JavaScript ★ 3 4y ago
Explain →
isaac-mchorse

EleutherAI's discord bot

Python ★ 3 5y ago
Explain →
pile-allpoetry

Scraper to gather poems from allpoetry.com

Python ★ 3 5y ago
Explain →
bucket-cleaner

A small utility to clear out old model checkpoints in Google Cloud Buckets whilst keeping tensorboard event files

Python ★ 3 5y ago
Explain →
unlearn

No description.

Python ★ 3 11d ago
Explain →
reddit-comment-processing

No description.

Python ★ 3 5y ago
Explain →
composer ⑂

Train neural networks up to 7x faster

Python ★ 3 3y ago
Explain →
gamescope

Can interpretability methods confer an advantage in competitive games?

Python ★ 2 6mo ago
Explain →
fmri

Analogue of fMRI on artificial neural networks

★ 2 1y ago
Explain →
pd-books

No description.

Jupyter Notebook ★ 2 2y ago
Explain →
tuned-lens ⑂

Tools for understanding how transformer predictions are built layer-by-layer

Python ★ 2 7mo ago
Explain →
tinydpo ⑂

No description.

★ 2 2y ago
Explain →
eleutherai-instruct-dataset

A large instruct dataset for open-source models (WIP).

★ 2 3y ago
Explain →
examples ⑂

Mosaicml example benchmarks + LLM scripts

Python ★ 2 3y ago
Explain →
minetest_game ⑂

Minetest Game - The default game for the Minetest engine [https://github.com/minetest/minetest/]

★ 2 3y ago
Explain →
groupoid-rl

No description.

Jupyter Notebook ★ 2 4y ago
Explain →
SkipTranscoderSAEBench

No description.

Python ★ 2 11mo ago
Explain →
POSER ⑂

Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals

Python ★ 2 1y ago
Explain →
auto-circuit ⑂

A library for efficient patching and automatic circuit discovery.

★ 2 1y ago
Explain →
huggingface.js ⑂

Utilities to use the Hugging Face Hub API

TypeScript ★ 2 1y ago
Explain →
truffaldino

Investigating goal instability in RL

Python ★ 1 1y ago
Explain →
rllm ⑂

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook ★ 1 1y ago
Explain →
bayesian-adam

Exactly what it says on the tin

Python ★ 1 2y ago
Explain →
RWKV-LM ⑂

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Python ★ 1 2y ago
Explain →
conceptual-constraints

Applying LEACE to models during training

Jupyter Notebook ★ 1 2y ago
Explain →
aria.cpp

GGML implementation of https://github.com/EleutherAI/aria

CMake ★ 1 2y ago
Explain →
classifier-latent-diffusion

No description.

Python ★ 1 2y ago
Explain →
language-adaptation

No description.

★ 1 2y ago
Explain →
maxtext ⑂

A simple, performant and scalable Jax LLM!

★ 1 3y ago
Explain →
irrlicht ⑂

Minetest's fork of Irrlicht

C++ ★ 1 2y ago
Explain →
lm-evaulation-ui

App for generating html table from LM evaluation JSONs

JavaScript ★ 1 4y ago
Explain →
poll_website_demo

Flask Based Polling Website Demo

Python ★ 1 5y ago
Explain →
discord-role-bot ⑂

Control Discord Roles with Reactions

★ 1 5y ago
Explain →
eleuther-blog

here is the generated content for the EleutherAI blog. Source is from new-website repo

HTML ★ 1 5y ago
Explain →
lang-filter

Filter text files or archives by language

Python ★ 1 5y ago
Explain →
pile-cord19

A script for collecting the CORD-19 dataset in a language modelling friendly format.

Python ★ 1 5y ago
Explain →
jusText ⑂

Heuristic based boilerplate removal tool

Python ★ 1 5y ago
Explain →
grouch

No description.

HTML ★ 1 5y ago
Explain →
circuit-breakers-SFT ⑂

Improving Alignment and Robustness with Circuit Breakers

★ 1 1y ago
Explain →
SAELens ⑂

Training Sparse Autoencoders on Language Models

★ 1 2y ago
Explain →
prefix-free-tokenizer

A prefix free tokenizer

Python ★ 1 2y ago
Explain →
common-llm-settings

Common LLM Settings App

JavaScript ★ 1 2y ago
Explain →
alignment-reader

Search and filter through alignment literature

JavaScript ★ 1 4y ago
Explain →
fractal-ml ⑂

Fun stuff with fractal machine learning

Jupyter Notebook ★ 1 5y ago
Explain →
cc_img_dl ⑂

No description.

Python ★ 1 4y ago
Explain →
gradient-routing ⑂

No description.

Python ★ 0 3mo ago
Explain →
rh-indicators

No description.

Python ★ 0 2mo ago
Explain →
hackable-bergson ⑂

Simplified library for mapping out the "memory" of neural nets with data attribution

★ 0 7mo ago
Explain →
vllm ⑂

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 0 7mo ago
Explain →
verifiers ⑂

Verifiers for LLM Reinforcement Learning

Python ★ 0 10mo ago
Explain →
wmdp ⑂

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.

Jupyter Notebook ★ 0 1y ago
Explain →
Megatron-LM ⑂

Ongoing research training transformer models at scale

★ 0 1y ago
Explain →
mixture-of-depths

No description.

★ 0 1y ago
Explain →
llm-score-behavior

No description.

Python ★ 0 1y ago
Explain →
TransformerEngine ⑂

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Python ★ 0 1y ago
Explain →
Plenoxels_FreeNerf ⑂

implmentation of Plenoxels radiance fields without neural networks, with free nerf strategy

★ 0 3y ago
Explain →
oslo-1 ⑂

OSLO: Open Source for Large-scale Optimization

★ 0 3y ago
Explain →
t-zero ⑂

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

★ 0 3y ago
Explain →
CommonLoopUtils ⑂

[WIP] a version of CLU with WandB logging added.

Jupyter Notebook ★ 0 3y ago
Explain →
pytorch-fid ⑂

Compute FID scores with PyTorch.

★ 0 4y ago
Explain →
visual-grounding-jax

Experiments pertaining to visually grounding Neo, built off of mesh-transformer-jax

★ 0 5y ago
Explain →
OpenInstructData

No description.

★ 0 5y ago
Explain →
depoison

Fixes poisoned directories in google cloud buckets

Python ★ 0 5y ago
Explain →
Garner-python ⑂

A library containing all you need to easily integrate with the Garner data crowdsourcing system

★ 0 5y ago
Explain →
pile-arxiv

No description.

Python ★ 0 5y ago
Explain →
lingvo ⑂

Lingvo

★ 0 5y ago
Explain →
djinn-problems

Problems generated by djinn (exploitably verifiable coding problems)

★ 0 5mo ago
Explain →
truncated-gaussian ▣

Method-of-moments estimation and sampling for truncated multivariate Gaussian distributions

Python ★ 0 2y ago
Explain →
CAA ⑂

Steering Llama 2 with Contrastive Activation Addition

★ 0 2y ago
Explain →
mup ⑂

maximal update parametrization (µP)

★ 0 2y ago
Explain →
gaia ⑂

Hugging Face and Pyserini interoperability

★ 0 3y ago
Explain →
omnitrack

Unified Experiment Tracking.

★ 0 5y ago
Explain →