Note: we’ve temporarily stopped development on the open-source version of OpenPipe to integrate some proprietary third-party code. We hope to make the non-proprietary parts of the repository open again under…
Note: we’ve temporarily stopped development on the open-source version of OpenPipe to integrate some proprietary third-party code. We hope to make the non-proprietary parts of the repository open again under an open core model once we have the bandwidth to do so!
OpenPipe
Open-source fine-tuning and model-hosting platform.
Demo - Running Locally - Docs
Use powerful but expensive LLMs to fine-tune smaller and cheaper models suited to your exact needs. Query your past requests and evaluate models against one another. Switch between OpenAI and fine-tuned models with one line of code.
Features
- Easy integration with OpenAI's SDK in both Python and TypeScript.
- OpenAI-compatible chat completions endpoint.
- Fine-tune GPT 3.5, Mistral, and Llama 2 models. Host on-platform or download the weights.
- Query logs using powerful built-in filters.
- Import datasets in OpenAI-compatible JSONL files.
- Prune large chunks of duplicate text like system prompts.
- Compare output accuracy against base models like gpt-3.5-turbo.
Supported Base Models
- mistralai/Mixtral-8x7B-Instruct-v0.1
- OpenPipe/mistral-ft-optimized-1227
- meta-llama/Llama-3-8B
- meta-llama/Llama-3-70B
- gpt-3.5-turbo-0613
- gpt-3.5-turbo-1106
- gpt-3.5-turbo-0125
Documentation
- See docs
Running Locally
1. Install Postgresql.
2. Install NodeJS 20 (earlier versions will very likely work but aren't tested).
3. Install pnpm: npm i -g pnpm
4. Clone this repository: git clone https://github.com/openpipe/openpipe
5. Install the dependencies: cd openpipe && pnpm install
6. Create a .env file (cd app && cp .env.example .env) and enter your OPENAI_API_KEY.
7. If you just installed postgres and wish to use the default DATABASE_URL run the following commands:
sh
psql postgres
CREATE ROLE postgres WITH LOGIN PASSWORD 'postgres';
ALTER ROLE postgres SUPERUSER;
8. Update DATABASE_URL if necessary to point to your Postgres instance and run pnpm prisma migrate dev in the app directory to create the database.
9. Create a GitHub OAuth App, set the callback URL to /api/auth/callback/github, e.g. http://localhost:3000/api/auth/callback/github.
10. Update the GITHUB_CLIENT_ID and GITHUB_CLIENT_SECRET values from the Github OAuth app (Note: a PR to make auth optional when running locally would be a great contribution!).
11. To start the app run pnpm dev in the app directory.
12. Navigate to http://localhost:3000
Using Locally
sh
import os
from openpipe import OpenAI
client = OpenAI(
api_key="Your API Key",
openpipe={
"api_key": "Your OpenPipe API Key",
"base_url": "http://localhost:3000/api/v1", # Local OpenPipe instance
}
)
completion = client.chat.completions.create(
model="gpt-3.5-turbo",
messages=[{"role": "system", "content": "count to 10"}],
openpipe={
"tags": {"prompt_id": "counting"},
"log_request": True
},
)
Testing Locally
1. Copy your .env file to .env.test.
2. Update the DATABASE_URL to have a different database name than your development one
3. Run DATABASE_URL=[your new datatase url] pnpm prisma migrate dev --skip-seed --skip-generate
4. Run pnpm test
-
ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!
Python ★ 10k 5h agoExplain → -
OpenPipe
Turn expensive prompts into cheap fine-tuned models
TypeScript ★ 2.8k 2y agoExplain → -
deductive-reasoning
Train your own SOTA deductive reasoning model
Python ★ 112 1y agoExplain → -
pii-redaction
Detect and redact PII locally with SOTA performance
Python ★ 109 1y agoExplain → -
open_deep_research_training
Training setup for Langchain's Open Deep Research
Python ★ 78 9mo agoExplain → -
Summary-RL
Train an agent to generate high quality summaries
Jupyter Notebook ★ 44 4mo agoExplain → -
rl-experiments
OpenPipe Reinforcement Learning Experiments
Jupyter Notebook ★ 34 1y agoExplain → -
email-deep-research
No description.
Python ★ 22 1y agoExplain → -
best-hn
No description.
Jupyter Notebook ★ 10 1y agoExplain → -
art-notebooks
Notebooks to demonstrate ART (Agent Reinforcement Trainer) in practice!
Shell ★ 8 19d agoExplain → -
art-langgraph
No description.
Python ★ 5 11mo agoExplain → -
step-one ⑂
This repo is only used for searching reddit
Python ★ 3 2y agoExplain → -
trpc-openapi ⑂
OpenAPI support for tRPC 🧩 - with streaming :)
TypeScript ★ 2 2y agoExplain → -
S3LoRAResolver
No description.
Python ★ 1 1y agoExplain → -
verl ⑂
verl: Volcano Engine Reinforcement Learning for LLMs
Python ★ 1 10mo agoExplain → -
art-star-count
Display ART repository star count on a tablet
HTML ★ 1 11mo agoExplain → -
vllm-lora ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs
Python ★ 1 1y agoExplain → -
tsoa ⑂
Build OpenAPI-compliant REST APIs using TypeScript and Node
TypeScript ★ 1 2y agoExplain → -
skypilot-modal ⑂
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
Python ★ 0 4d agoExplain → -
vllm-completions ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs
Python ★ 0 11mo agoExplain → -
ArcticInference ⑂
No description.
Python ★ 0 1y agoExplain → -
skypilot-catalog ⑂
No description.
★ 0 1y agoExplain → -
sglang ⑂
SGLang is a fast serving framework for large language models and vision language models.
Python ★ 0 1y agoExplain → -
trl ⑂
Train transformer language models with reinforcement learning.
★ 0 1y agoExplain → -
axolotl ⑂
Go ahead and axolotl questions
Python ★ 0 2y agoExplain → -
vllm ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs
Python ★ 0 2y agoExplain → -
alpaca_eval ⑂
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook ★ 0 2y agoExplain → -
mistral-client-js ⑂
JS Client library for Mistral AI platform
★ 0 2y agoExplain → -
openapi-typescript-codegen ⑂
NodeJS library that generates Typescript or Javascript clients based on the OpenAPI specification
★ 0 1y agoExplain →
No repos match these filters.