Kobe Chen

@kobe0938 ·Santa Clara

I work on Agents & Agents Eval, previously worked on Agents Infra.

44 repos
28 followers
13 following

Python 45%
Jupyter Notebook 18%
Java 9%
Shell 5%
TypeScript 5%

Hi, I'm Kobe 👋 🚀 Currently Maintaining/Contributing Harbor — Agent evaluation framework and RL environment toolkit. [[paper]](https://arxiv.org/abs/2601.11868) SkillsBench — Evaluating how well skills work and how effective agents are at…

All public repos (44)

Show forks Show archived

claude-code-tracing

No description.

Python ★ 3 26d ago
Explain →
llm-inference-fast-benchmark

This repository benchmarks the performance of large language models (LLMs) on a 8B role play model(Sao10K/L3-8B-Lunaris-v1) with an average input of 4k tokens and an output of 250 tokens

Python ★ 3 1y ago
Explain →
LMCache ⑂

Making Long-Context LLM Inference 10x Faster and 10x Cheaper

Python ★ 2 7mo ago
Explain →
cacheserve

No description.

Jupyter Notebook ★ 1 6mo ago
Explain →
docker-hub-star-tracker

No description.

Python ★ 0 27m ago
Explain →
harbor ⑂

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python ★ 0 3h ago
Explain →
harborize-harbor-check-experiment-logs

No description.

Python ★ 0 6d ago
Explain →
habor-Include-Exclude-patterns-in-agent-verifier

No description.

Shell ★ 0 9d ago
Explain →
terminal-bench-leaderboard-detection-blog

No description.

Python ★ 0 11d ago
Explain →
harbor-datasets ⑂

No description.

HTML ★ 0 5mo ago
Explain →
kobe0938

My profile README — agents, LLM infra, and the projects I'm working on.

★ 0 1mo ago
Explain →
long-horizon ⑂

Verifiable long-horizon SWE tasks

★ 0 1mo ago
Explain →
citation-verifier

No description.

TypeScript ★ 0 2mo ago
Explain →
ghostfolio-fork ⑂

Open Source Wealth Management Software. Angular + NestJS + Prisma + Nx + TypeScript 🤍

TypeScript ★ 0 2mo ago
Explain →
smolclaw-fork ⑂

High resolution mock environments for testing and improving claw like agents

Python ★ 0 3mo ago
Explain →
terminal-bench ⑂

A benchmark for LLMs on complicated tasks in the terminal

Python ★ 0 7mo ago
Explain →
LAG

No description.

Python ★ 0 3mo ago
Explain →
terminal-bench-3-fork ⑂

🚧 Accepting Task Submissions 🚧

Python ★ 0 4mo ago
Explain →
awesome-harbor ⑂

A curated list of awesome Harbor ecosystem projects

★ 0 4mo ago
Explain →
skillsbench ⑂

SkillsBench evaluates how well skills work and how effective agents are at using them

PDDL ★ 0 4mo ago
Explain →
blog

No description.

HTML ★ 0 4mo ago
Explain →
tb-parity_experiment-trace

No description.

★ 0 7mo ago
Explain →
terminal-bench-datasets ⑂

No description.

Python ★ 0 7mo ago
Explain →
vllm-fork ⑂

A high-throughput and memory-efficient inference and serving engine for LLMs

Python ★ 0 8mo ago
Explain →
lmcache-trace-analysis

No description.

Python ★ 0 8mo ago
Explain →
lmcache.github.io ⑂

LMCache official blog

HTML ★ 0 8mo ago
Explain →
production-stack ⑂

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python ★ 0 9mo ago
Explain →
VidGen

This research project, developed by Diffusive AI, explores the use of diffusion models and autoregressive models for generating interactive videos and games.

Python ★ 0 1y ago
Explain →
mooncake-trace-replayer

No description.

Python ★ 0 10mo ago
Explain →
docs

No description.

MDX ★ 0 1y ago
Explain →
fastapi

No description.

Python ★ 0 1y ago
Explain →
rembg ⑂

Rembg is a tool to remove images background

Python ★ 0 1y ago
Explain →
gorilla ⑂

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python ★ 0 1y ago
Explain →
ChatGPT-Next-Web-BISV ⑂

A well-designed cross-platform ChatGPT UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT 应用。

★ 0 2y ago
Explain →
chatgpt-vercel ⑂

Elegant and Powerfull. Powered by OpenAI and Vercel.

★ 0 3y ago
Explain →
seismic-hazard-risk-class ⑂

Code supporting Jack Baker's seismic hazard and risk analysis class

★ 0 3y ago
Explain →
Large-Platform-Reinforcement-Learning-Model

No description.

Jupyter Notebook ★ 0 3y ago
Explain →
Full-Stack-Web-Application

No description.

JavaScript ★ 0 3y ago
Explain →
leetcode ⑂

Leetcode solutions

★ 0 3y ago
Explain →
Data-Structure

No description.

Java ★ 0 4y ago
Explain →
Piglets-Nursing-Level-Prediction

No description.

Jupyter Notebook ★ 0 4y ago
Explain →
DataBase-SQL

No description.

Jupyter Notebook ★ 0 4y ago
Explain →
Android-BunnyWorld

No description.

Java ★ 0 4y ago
Explain →
Machine-Learning

No description.

TeX ★ 0 4y ago
Explain →

Kobe Chen

Hi, I'm Kobe 👋

🚀 Currently Maintaining/Contributing

🛠️ Previous Projects

Agents & Evaluation

LLM Inference & Serving Infra

Others

All public repos (44)