-
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Python ★ 13k 18d agoExplain → -
gpt-neo ▣
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Python ★ 8.3k 4y agoExplain → -
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Python ★ 7.4k 9d agoExplain → -
pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Jupyter Notebook ★ 2.8k 7mo agoExplain → -
the-pile
No description.
Python ★ 1.7k 3y agoExplain → -
math-lm
No description.
Python ★ 1.1k 2y agoExplain → -
cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Python ★ 845 3mo agoExplain → -
sparsify
Sparsify transformers with SAEs and transcoders
Python ★ 727 5d agoExplain → -
polyglot
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
★ 487 2y agoExplain → -
DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow.
Python ★ 431 4y agoExplain → -
vqgan-clip
No description.
Jupyter Notebook ★ 353 4y agoExplain → -
delphi
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
Python ★ 262 5d agoExplain → -
concept-erasure
Erasing concepts from neural representations with provable guarantees
Python ★ 255 1y agoExplain → -
elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
Python ★ 220 5d agoExplain → -
nanoGPT-mup ⑂
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Python ★ 196 5mo agoExplain → -
oslo
OSLO: Open Source for Large-scale Optimization
Python ★ 175 2y agoExplain → -
DeeperSpeed ⑂
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python ★ 173 8mo agoExplain → -
lm_perplexity
No description.
Python ★ 164 5y agoExplain → -
knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models.
Python ★ 160 4y agoExplain → -
pyfra
Python Research Framework
Python ★ 107 3y agoExplain → -
aria
Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)
Python ★ 105 1mo agoExplain → -
github-downloader ⑂
Script for downloading GitHub.
Python ★ 99 1y agoExplain → -
openwebtext2
No description.
Python ★ 95 3y agoExplain → -
dps
Data processing system for polyglot
Python ★ 93 2y agoExplain → -
stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
Python ★ 87 2y agoExplain → -
improved-t5
Experiments for efforts to train a new and improved t5
Python ★ 76 2y agoExplain → -
minetest ⑂
Minetest is an open source voxel game engine with easy modding and game creation
C++ ★ 75 2y agoExplain → -
aria-amt
Efficient and robust implementation of seq-to-seq automatic piano transcription.
Python ★ 70 6mo agoExplain → -
project-menu
See the issue board for the current status of active and prospective projects!
★ 65 4y agoExplain → -
bergson
Mapping out the "memory" of neural nets with data attribution
Python ★ 61 4d agoExplain → -
magiCARP
One stop shop for all things carp
Python ★ 58 3y agoExplain → -
semantic-memorization
No description.
Jupyter Notebook ★ 44 1y agoExplain → -
tqdm-multiprocess
Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly through the main process. It offers similar functionality for python logging.
Python ★ 43 5y agoExplain → -
features-across-time
Understanding how features learned by neural networks evolve throughout training
Python ★ 41 1y agoExplain → -
mp_nerf
Massively-Parallel Natural Extension of Reference Frame
Jupyter Notebook ★ 34 3y agoExplain → -
hae-rae
No description.
★ 33 2y agoExplain → -
rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
Jupyter Notebook ★ 32 2y agoExplain → -
elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard
Python ★ 31 2y agoExplain → -
steering-llama3
No description.
Python ★ 30 1y agoExplain → -
tokengrams
Efficiently computing & storing token n-grams from large corpora
Rust ★ 27 6d agoExplain → -
clt-training ⑂
Sparsify transformers with cross-layer transcoders
Python ★ 26 7mo agoExplain → -
pile-pubmedcentral
A script for collecting the PubMed Central dataset in a language modelling friendly format.
Python ★ 26 5y agoExplain → -
training-jacobian
No description.
Jupyter Notebook ★ 24 1y agoExplain → -
w2s
No description.
Python ★ 24 1y agoExplain → -
mdl
Minimum Description Length probing for neural network representations
Python ★ 20 1y agoExplain → -
deep-ignorance
No description.
Python ★ 19 5mo agoExplain → -
polyglot-data
data related codebase for polyglot project
Python ★ 19 3y agoExplain → -
BIG-bench ⑂
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Python ★ 19 4y agoExplain → -
best-download
URL downloader supporting checkpointing and continuous checksumming.
Python ★ 19 2y agoExplain → -
pile_dedupe
Pile Deduplication Code
Python ★ 18 3y agoExplain → -
attribute ⑂
No description.
Python ★ 16 7mo agoExplain → -
latent-video-diffusion
Latent video diffusion
Python ★ 16 2y agoExplain → -
NeMo ⑂
NeMo: a toolkit for conversational AI
Python ★ 16 3y agoExplain → -
pile-cc ⑂
No description.
★ 16 4y agoExplain → -
text-generation-testing-ui
Web app for demoing the EAI models
JavaScript ★ 16 4y agoExplain → -
exploring-contrastive-topology
No description.
Jupyter Notebook ★ 15 4y agoExplain → -
polyapprox
Closed-form polynomial approximations to neural networks
Python ★ 13 1y agoExplain → -
pilev2
No description.
Python ★ 13 3y agoExplain → -
pile-literotica
Download, parse, and filter data from Literotica. Data-ready for The-Pile.
Python ★ 12 5y agoExplain → -
datasets ⑂
🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
★ 12 5y agoExplain → -
radioactive-lab
Adapting the "Radioactive Data" paper to work for text models
Python ★ 12 5y agoExplain → -
lm_dataformat ⑂
No description.
Python ★ 11 2y agoExplain → -
transformer-reasoning ⑂
Experiments in transformer knowledge and reasoning
Jupyter Notebook ★ 10 1y agoExplain → -
architecture-objective ⑂
No description.
Python ★ 10 3y agoExplain → -
equivariance
A framework for implementing equivariant DL
Jupyter Notebook ★ 10 5y agoExplain → -
djinn
Generating, validating and running exploitable verifiable coding problems
Python ★ 10 5mo agoExplain → -
hn-scraper
No description.
Python ★ 9 5y agoExplain → -
llemma-sample-explorer
Sample explorer tool for the Llemma models.
HTML ★ 9 2y agoExplain → -
attention-probes
Linear probes with attention weighting
Python ★ 8 10mo agoExplain → -
equinox-llama
Equinox implementation of llama3 and llama3.1
Python ★ 8 1y agoExplain → -
GPTeacher ⑂
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
★ 8 3y agoExplain → -
minetest-baselines
Baseline agents for Minetest tasks.
Python ★ 8 2y agoExplain → -
website
New website for EleutherAI based on Hugo static site generator
HTML ★ 8 3d agoExplain → -
pile-uspto
A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.
Python ★ 8 5y agoExplain → -
tyche
Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
Jupyter Notebook ★ 8 1y agoExplain → -
tagged-pile
Part-of-Speech Tagging for the Pile and RedPajama
Python ★ 8 3y agoExplain → -
ccs
No description.
Python ★ 7 1y agoExplain → -
multimodal-fid
No description.
Python ★ 7 3y agoExplain → -
aria-utils
MIDI tokenizers and pre-processing utils.
Python ★ 6 5d agoExplain → -
cupbearer ⑂
A library for mechanistic anomaly detection
Jupyter Notebook ★ 6 1y agoExplain → -
weak-to-strong ⑂
No description.
Python ★ 6 2y agoExplain → -
trlx ⑂
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python ★ 6 2y agoExplain → -
minetest-interpretabilty-notebook
Jupyter notebook for the interpretablity section of the minetester blog post
Jupyter Notebook ★ 6 2y agoExplain → -
CodeCARP
Data collection pipeline for CodeCARP. Includes PyCharm plugins.
★ 6 4y agoExplain → -
pile-enron-emails
A script for collecting the Enron Emails dataset in a language modelling friendly format.
Python ★ 6 5y agoExplain → -
LLM-Markov-Chains
Project github for LLM Markov Chains Project
★ 6 3y agoExplain → -
pile-cc-filtering
The code used to filter CC data for The Pile
Python ★ 6 5y agoExplain → -
clearnets
No description.
Python ★ 5 1y agoExplain → -
optax-galore
Adds GaLore style projection wrappers to optax optimizers
Python ★ 5 1y agoExplain → -
architecture-experiments
Repository to host architecture experiments and development using Paxml and Praxis
Python ★ 5 3y agoExplain → -
FLAN ⑂
No description.
Python ★ 5 3y agoExplain → -
thonkenizers
yes
★ 5 4y agoExplain → -
visual-grounding
Visually ground GPT-Neo 1.3b and 2.7b
Python ★ 5 5y agoExplain → -
eleutherai.github.io
This is the Hugo generated website for eleuther.ai. The source of this build is new-website repo.
HTML ★ 5 5y agoExplain → -
pile-explorer
For exploring the data and documenting its limitations
Python ★ 5 5y agoExplain → -
open-r1 ⑂
Fully open reproduction of DeepSeek-R1
Python ★ 5 1y agoExplain → -
scalable-elicitation ⑂
The code used in "Balancing Label Quantity and Quality for Scalable Elicitation"
Jupyter Notebook ★ 4 1y agoExplain → -
monkfish ⑂
No description.
Python ★ 4 1y agoExplain → -
alignment-handbook ⑂
Robust recipes for to align language models with human and AI preferences
★ 4 2y agoExplain → -
Unpaired-Image-Generation
Project Repo for Unpaired Image Generation project
★ 4 2y agoExplain → -
lm-scope
No description.
Jupyter Notebook ★ 4 4y agoExplain → -
megatron-3d ▣
No description.
Python ★ 4 5y agoExplain → -
pile-website ⑂
No description.
HTML ★ 4 2y agoExplain → -
pile-ubuntu-irc
A script for collecting the Ubuntu IRC dataset in a language modelling friendly format.
Python ★ 4 5y agoExplain → -
emergent-misalignment ⑂
No description.
Python ★ 4 3d agoExplain → -
ngrams-across-time
No description.
Jupyter Notebook ★ 4 1y agoExplain → -
rtopk
https://github.com/xiexi51/RTopK PyTorch wrapper
Cuda ★ 3 1y agoExplain → -
sae_overlap
Acompanying code for our research on SAE feature overlap when trained on different seeds.
Jupyter Notebook ★ 3 1y agoExplain → -
variance-across-time
Studying the variance in neural net predictions across training time
Python ★ 3 2y agoExplain → -
EvilModel
A replication of "EvilModel 2.0: Bringing Neural Network Models into Malware Attacks"
★ 3 2y agoExplain → -
eai-prompt-gallery
Library of interesting prompt generations
JavaScript ★ 3 4y agoExplain → -
isaac-mchorse
EleutherAI's discord bot
Python ★ 3 5y agoExplain → -
pile-allpoetry
Scraper to gather poems from allpoetry.com
Python ★ 3 5y agoExplain → -
bucket-cleaner
A small utility to clear out old model checkpoints in Google Cloud Buckets whilst keeping tensorboard event files
Python ★ 3 5y agoExplain → -
unlearn
No description.
Python ★ 3 11d agoExplain → -
reddit-comment-processing
No description.
Python ★ 3 5y agoExplain → -
composer ⑂
Train neural networks up to 7x faster
Python ★ 3 3y agoExplain → -
gamescope
Can interpretability methods confer an advantage in competitive games?
Python ★ 2 6mo agoExplain → -
fmri
Analogue of fMRI on artificial neural networks
★ 2 1y agoExplain → -
pd-books
No description.
Jupyter Notebook ★ 2 2y agoExplain → -
tuned-lens ⑂
Tools for understanding how transformer predictions are built layer-by-layer
Python ★ 2 7mo agoExplain → -
tinydpo ⑂
No description.
★ 2 2y agoExplain → -
eleutherai-instruct-dataset
A large instruct dataset for open-source models (WIP).
★ 2 3y agoExplain → -
examples ⑂
Mosaicml example benchmarks + LLM scripts
Python ★ 2 3y agoExplain → -
minetest_game ⑂
Minetest Game - The default game for the Minetest engine [https://github.com/minetest/minetest/]
★ 2 3y agoExplain → -
groupoid-rl
No description.
Jupyter Notebook ★ 2 4y agoExplain → -
SkipTranscoderSAEBench
No description.
Python ★ 2 11mo agoExplain → -
POSER ⑂
Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
Python ★ 2 1y agoExplain → -
auto-circuit ⑂
A library for efficient patching and automatic circuit discovery.
★ 2 1y agoExplain → -
huggingface.js ⑂
Utilities to use the Hugging Face Hub API
TypeScript ★ 2 1y agoExplain → -
truffaldino
Investigating goal instability in RL
Python ★ 1 1y agoExplain → -
rllm ⑂
Democratizing Reinforcement Learning for LLMs
Jupyter Notebook ★ 1 1y agoExplain → -
bayesian-adam
Exactly what it says on the tin
Python ★ 1 2y agoExplain → -
RWKV-LM ⑂
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Python ★ 1 2y agoExplain → -
conceptual-constraints
Applying LEACE to models during training
Jupyter Notebook ★ 1 2y agoExplain → -
aria.cpp
GGML implementation of https://github.com/EleutherAI/aria
CMake ★ 1 2y agoExplain → -
classifier-latent-diffusion
No description.
Python ★ 1 2y agoExplain → -
language-adaptation
No description.
★ 1 2y agoExplain → -
maxtext ⑂
A simple, performant and scalable Jax LLM!
★ 1 3y agoExplain → -
irrlicht ⑂
Minetest's fork of Irrlicht
C++ ★ 1 2y agoExplain → -
lm-evaulation-ui
App for generating html table from LM evaluation JSONs
JavaScript ★ 1 4y agoExplain → -
poll_website_demo
Flask Based Polling Website Demo
Python ★ 1 5y agoExplain → -
discord-role-bot ⑂
Control Discord Roles with Reactions
★ 1 5y agoExplain → -
eleuther-blog
here is the generated content for the EleutherAI blog. Source is from new-website repo
HTML ★ 1 5y agoExplain → -
lang-filter
Filter text files or archives by language
Python ★ 1 5y agoExplain → -
pile-cord19
A script for collecting the CORD-19 dataset in a language modelling friendly format.
Python ★ 1 5y agoExplain → -
jusText ⑂
Heuristic based boilerplate removal tool
Python ★ 1 5y agoExplain → -
grouch
No description.
HTML ★ 1 5y agoExplain → -
circuit-breakers-SFT ⑂
Improving Alignment and Robustness with Circuit Breakers
★ 1 1y agoExplain → -
SAELens ⑂
Training Sparse Autoencoders on Language Models
★ 1 2y agoExplain → -
prefix-free-tokenizer
A prefix free tokenizer
Python ★ 1 2y agoExplain → -
common-llm-settings
Common LLM Settings App
JavaScript ★ 1 2y agoExplain → -
alignment-reader
Search and filter through alignment literature
JavaScript ★ 1 4y agoExplain → -
fractal-ml ⑂
Fun stuff with fractal machine learning
Jupyter Notebook ★ 1 5y agoExplain → -
cc_img_dl ⑂
No description.
Python ★ 1 4y agoExplain → -
gradient-routing ⑂
No description.
Python ★ 0 3mo agoExplain → -
rh-indicators
No description.
Python ★ 0 2mo agoExplain → -
hackable-bergson ⑂
Simplified library for mapping out the "memory" of neural nets with data attribution
★ 0 7mo agoExplain → -
vllm ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs
★ 0 7mo agoExplain → -
verifiers ⑂
Verifiers for LLM Reinforcement Learning
Python ★ 0 10mo agoExplain → -
wmdp ⑂
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
Jupyter Notebook ★ 0 1y agoExplain → -
Megatron-LM ⑂
Ongoing research training transformer models at scale
★ 0 1y agoExplain → -
mixture-of-depths
No description.
★ 0 1y agoExplain → -
llm-score-behavior
No description.
Python ★ 0 1y agoExplain → -
TransformerEngine ⑂
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Python ★ 0 1y agoExplain → -
Plenoxels_FreeNerf ⑂
implmentation of Plenoxels radiance fields without neural networks, with free nerf strategy
★ 0 3y agoExplain → -
oslo-1 ⑂
OSLO: Open Source for Large-scale Optimization
★ 0 3y agoExplain → -
t-zero ⑂
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
★ 0 3y agoExplain → -
CommonLoopUtils ⑂
[WIP] a version of CLU with WandB logging added.
Jupyter Notebook ★ 0 3y agoExplain → -
pytorch-fid ⑂
Compute FID scores with PyTorch.
★ 0 4y agoExplain → -
visual-grounding-jax
Experiments pertaining to visually grounding Neo, built off of mesh-transformer-jax
★ 0 5y agoExplain → -
OpenInstructData
No description.
★ 0 5y agoExplain → -
depoison
Fixes poisoned directories in google cloud buckets
Python ★ 0 5y agoExplain → -
Garner-python ⑂
A library containing all you need to easily integrate with the Garner data crowdsourcing system
★ 0 5y agoExplain → -
pile-arxiv
No description.
Python ★ 0 5y agoExplain → -
lingvo ⑂
Lingvo
★ 0 5y agoExplain → -
djinn-problems
Problems generated by djinn (exploitably verifiable coding problems)
★ 0 5mo agoExplain → -
truncated-gaussian ▣
Method-of-moments estimation and sampling for truncated multivariate Gaussian distributions
Python ★ 0 2y agoExplain → -
CAA ⑂
Steering Llama 2 with Contrastive Activation Addition
★ 0 2y agoExplain → -
mup ⑂
maximal update parametrization (µP)
★ 0 2y agoExplain → -
gaia ⑂
Hugging Face and Pyserini interoperability
★ 0 3y agoExplain → -
omnitrack
Unified Experiment Tracking.
★ 0 5y agoExplain →
No repos match these filters.