CarperAI ORG

@CarperAI ·carper.ai

FOSS RLHF

27 repos
542 followers
0 following

Python 73%
Jupyter Notebook 27%

All public repos (27)

Show forks Show archived

trlx ★ PINNED

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python ★ 4.7k 2y ago
Explain →
cheese ★ PINNED

Used for adaptive human in the loop evaluation of language and embedding models.

Python ★ 306 3y ago
Explain →
OpenELM ★ PINNED

Evolution Through Large Models

Python ★ 741 2y ago
Explain →
DRLX ★ PINNED

Diffusion Reinforcement Learning Library

Python ★ 195 2y ago
Explain →
Code-Pile

This repository contains all the code for collecting large scale amounts of code from GitHub.

Python ★ 110 3y ago
Explain →
autocrit

A repository for transformer critique learning and generation

Python ★ 89 2y ago
Explain →
InstructGPT

For experiments involving instruct gpt. Currently used for documenting open research questions.

★ 71 3y ago
Explain →
squeakily

A library for squeakily cleaning and filtering language datasets.

Jupyter Notebook ★ 50 3y ago
Explain →
Algorithm-Distillation-RLHF

No description.

Python ★ 35 3y ago
Explain →
decontamination

This repository contains code for cleaning your training data of benchmark data to help combat data snooping.

Jupyter Notebook ★ 28 3y ago
Explain →
treasure_trove

No description.

Jupyter Notebook ★ 22 2y ago
Explain →
nmmo-environment ⑂

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Python ★ 15 2y ago
Explain →
CodeReviewSE

Stuff related to scraping the Code Review StackExchange

Python ★ 12 3y ago
Explain →
pilev2 ⑂

No description.

Python ★ 11 3y ago
Explain →
magicarp-v2

magiCARP is an API used for crossencoder training.

Python ★ 9 2y ago
Explain →
nmmo-baselines ⑂

Baselines for Neural MMO -- new users should treat this repo as a starter project

Python ★ 7 1y ago
Explain →
ArchitextRL

No description.

Python ★ 7 3y ago
Explain →
Polygraph

RLHF Mechanistic Interpretability and Deception

★ 6 2y ago
Explain →
data-preparation ⑂

Code used for sourcing and cleaning the BigScience ROOTS corpus

E ★ 4 3y ago
Explain →
FastChat ⑂

An open platform for training, serving, and evaluating large language model based chatbots.

★ 4 3y ago
Explain →
AutoPaperclipMaximizer

👀

★ 3 3y ago
Explain →
sft

No description.

Python ★ 2 3y ago
Explain →
contriever ⑂

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python ★ 2 3y ago
Explain →
maxtext ⑂

A simple, performant and scalable Jax LLM!

Python ★ 1 3y ago
Explain →
diversity_metrics

No description.

Jupyter Notebook ★ 1 2y ago
Explain →
goosebox

sandboxed eval server for running code snippets

★ 1 3y ago
Explain →
tinypar ⑂

No description.

Python ★ 0 2y ago
Explain →