5-day current streak·32-day longest streak
Hi, I'm Samzong (船长) 👋 What I build Inference Serving Stack: Scheduling and routing across heterogeneous models — the production path for vLLM and llm-d workloads on Kubernetes. Kubernetes-Native Workload…
Hi, I'm Samzong (船长) 👋
What I build
- Inference Serving Stack: Scheduling and routing across heterogeneous models — the production path for vLLM and llm-d workloads on Kubernetes.
- Kubernetes-Native Workload Plumbing: Batch queueing, multi-cluster scheduling, and GPU sharing for AI/ML workloads — contributing upstream to Kueue, Karmada, and HAMi.
- Agent Harnesses: The supervision layer for long-running LLM agents — parallel sessions, multi-agent teams, artifacts, approval gates, scheduled dispatch. Because the work shouldn't collapse into chat.
- Agentic Developer Workflows: Multi-worktree dispatch, Claude Code skills, and commit/PR/task automation — the inner loop I live in daily.
Contributions
- OpenClaw (Core Contributor, Current Focus): Open-source agent infrastructure for long-running, multi-channel AI work. 
- semantic-router (Committer): Defining the decision-making layer for multi-model LLM serving. 
- llm-d: Cloud-native infrastructure for disaggregated LLM inference. 
- HAMi & Kueue: Kubernetes-native batch scheduling and GPU virtualization. 
- Istio: Traffic governance for the service mesh layer. 
- Karmada & Kubernetes: Multi-cluster orchestration foundations.  
Side Projects
I build tools to fix my own problems.
- ClawWork: A desktop workspace for OpenClaw — parallel task sessions, multi-agent Teams, TeamsHub marketplace, artifacts, and scheduled automation.  
- lathe: Agent-friendly CLI generator for APIs: turn Swagger, OpenAPI, and google.api.http protos into single-binary Cobra CLIs.  
- Recall: Local-first TUI for searching Claude Code, Codex, and OpenCode history.  
- mailbell: Minimal macOS menu bar notifier for Gmail.  
- gmc: Parallel git worktrees for parallel AI agents, plus AI-generated commits.  
- codex-agents-local: Local
AGENTS.local.mdoverlays for Codex via hooks, without replacing the officialcodexcommand.  - merge-scout: A Claude Code skill for vibe coding — ranks GitHub issues by contributability × merge probability so your agent picks work that will actually land. 
- mote: Rewrite any selected text on macOS via Markdown-defined commands. 
- Chrome TabBoost: Browser tab overload is a bug. I patched it with an extension.  
- MacMusicPlayer: A minimalist, clean music player for macOS.  
- ConfigForge: Manage
~/.ssh/configandkubeconfigwithout the headache.   - LogoWallpaper: Generating brand assets shouldn't take 30 minutes.  
- SaveEye: A minimalist eye care reminder that doesn't annoy you.  
- Branchlight: Checking PRs in browser tabs is a ritual. Moved it to the menubar.  
- mdctl: AI-powered Markdown workflow automation.  
- hf-model-downloader: Painless Hugging Face model downloads.  
- mirrormate: Docker pulls failing? I fixed it with magic.  
- swagger-online: Unified Swagger UI. No more tab chaos.  
- ai-icon-generator: I needed icons, so I built a generator.  
- gofs:
python -m http.serveris slow. Rewrote it in Go.   - gh-x: Batch repo operations as a
ghCLI extension. One-by-one doesn't scale.   - convostore: Stateful LLM APIs shouldn't be this hard. A Redis-backed conversation store for OpenAI Responses API / vLLM. 
- prompts: Prompt engineering is the real frontend of LLMs. I track mine here.  
- openclaw-gateway-tunnel *(RFC, design phase)*: Zero-binary ngrok tunnel for OpenClaw gateways. Looking for co-designers before code lands. 
- moltbot-channel-feishu *(legacy, superseded by OpenClaw official)*: Feishu/Lark channel plugin for Moltbot/Clawdbot.  
-
ClawWork ★ PINNED ⑂
Client for OpenClaw — Connect ClawWork to your own OpenClaw and unlock 10x multi-session productivity.
★ 0 2mo agoExplain → -
gmc ★ PINNED
Parallel git worktrees for parallel AI agents — plus AI-generated commits.
Go ★ 14 4d agoExplain → -
semantic-router ★ PINNED ⑂
Intelligent Mixture-of-Models Router for Efficient LLM Inference
Go ★ 0 13d agoExplain → -
kueue ★ PINNED ⑂
Kubernetes-native Job Queueing
Go ★ 0 1mo agoExplain → -
MacMusicPlayer ★ PINNED
A clean, lightweight music player for macOS.
Swift ★ 83 3d agoExplain → -
chrome-tabboost
TabBoost is a Google Chrome extension that replicates commonly used features from Arc browser to enhance the Chrome user experience.
JavaScript ★ 80 7mo agoExplain → -
ConfigForge
An open-source SSH and Kubernetes configuration management tool designed for macOS users.
Swift ★ 44 9mo agoExplain → -
prompts
only manage prompts, the future of LLMs is prompt engineering.
TypeScript ★ 35 1y agoExplain → -
hf-model-downloader
A cross-platform GUI application for easily downloading Hugging Face models without requiring technical knowledge or setup.
Python ★ 29 6mo agoExplain → -
mdctl
An AI-powered CLI tool to enhance your Markdown workflow, with auto-image downloading, translation, and more features coming soon!
Go ★ 26 7mo agoExplain → -
ai-icon-generator
An opensource icon generation tool based on OpenAI gpt-image-1.
TypeScript ★ 18 7mo agoExplain → -
Recall
Local-first search, usage, export, and resume across AI coding sessions from Claude Code, Codex, OpenCode, Cursor, Gemini, Cline, Pi, Kiro, Copilot CLI, and Antigravity. Hybrid FTS + embeddings, JSONL export, per-source usage dashboard, all on-device.
Rust ★ 16 1d agoExplain → -
samzong
No description.
HTML ★ 11 10d agoExplain → -
n8n-trans
如果你希望 n8n 有中文(其实可以各种语言)
JavaScript ★ 10 9mo agoExplain → -
gofs
A lightweight, fast HTTP file server written in Go.
Go ★ 9 2mo agoExplain → -
moltbot-channel-feishu
Legacy Feishu/Lark channel plugin for Moltbot/Clawdbot. OpenClaw now has an official Feishu channel.
TypeScript ★ 7 3mo agoExplain → -
LogoWallpaper
A macOS app for quickly generating clean brand wallpapers with multi-display support
Swift ★ 6 7mo agoExplain → -
agent-brains
Personal agent brain catalog for coding agents: AGENTS.md, SOUL.md, MEMORY.md, skills, workflows, and a bundled loader.
Python ★ 4 3d agoExplain → -
mote
Menu bar macOS app for rewriting selected text with OpenAI-compatible models.
Swift ★ 4 17d agoExplain → -
brew-updater
No description.
Go ★ 4 4mo agoExplain → -
homebrew-tap
This is a custom Homebrew tap for my personal applications and tools.
Ruby ★ 3 4h agoExplain → -
codex-agents-local
Local AGENTS.local.md overlays for Codex via hooks, without replacing the official codex command.
Python ★ 3 19d agoExplain → -
SaveEye
A minimalist macOS eye care reminder app.
Swift ★ 3 9d agoExplain → -
branchlight
A quiet macOS menubar hub for GitHub work.
Swift ★ 3 21d agoExplain → -
DaoCloud-docs ⑂
DaoCloud Enterprise Open Documents
Python ★ 3 1mo agoExplain → -
yt-search-api
A high-performance API service for yt-dlp proxying online search services.(such as YouTube)
TypeScript ★ 3 1y agoExplain → -
sd-chat
An AI creation platform based on Stable Diffusion, supporting both image and video generation.
Python ★ 3 1y agoExplain → -
merge-scout
No description.
TypeScript ★ 2 1mo agoExplain → -
swagger-online
Swagger Online is a lightweight React tool that aggregates multiple Swagger/OpenAPI specs into one unified, searchable, and comparable interface.
JavaScript ★ 2 5mo agoExplain → -
slidev-theme-daocloud
No description.
Vue ★ 2 5mo agoExplain → -
modelfs
No description.
Go ★ 2 6mo agoExplain → -
taobao-openapi
fit python3
Python ★ 2 4y agoExplain → -
mailbell
Minimal macOS menu bar notifier for Gmail.
Swift ★ 1 8d agoExplain → -
karmada-website ⑂
Karmada website and documentation repo
Shell ★ 1 8mo agoExplain → -
Awesome-LLMOps ⑂
🎉 An awesome & curated list of best LLMOps tools.
Python ★ 1 1y agoExplain → -
mkdocs-with-pdf-support-material-v8 ⑂
Generate a single PDF file from MkDocs repository.
HTML ★ 1 3y agoExplain → -
dify-prototype
No description.
TypeScript ★ 0 1d agoExplain → -
prooflet
The proof layer for prototypes.
TypeScript ★ 0 2d agoExplain → -
openclaw ⑂
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
TypeScript ★ 0 10d agoExplain → -
mm
Multi-language maintenance CLI for documentation workflows
Go ★ 0 18d agoExplain → -
openab ⑂
A lightweight, secure, cloud-native ACP harness that bridges Discord and any ACP-compatible coding CLI.
★ 0 22d agoExplain → -
k-gateway-api ⑂
Repository for the next iteration of composite service (e.g. Ingress) and load balancing APIs.
Go ★ 0 23d agoExplain → -
clawsweeper ⑂
ClawSweeper scans all issues and PRs and suggest what we can close, and why. It runs every PR / Issue once a week.
JavaScript ★ 0 26d agoExplain → -
open-design ⑂
🎨 Local-first, open-source alternative to Anthropic's Claude Design. ⚡ 19 Skills · ✨ 71 brand-grade Design Systems · 🖼️ sandboxed preview · 📦 HTML/PDF/PPTX export. 🤖 Runs on Claude Code / Codex / Cursor / Gemini CLI / OpenCode / Qwen.
TypeScript ★ 0 18d agoExplain → -
matrixhub ⑂
No description.
Go ★ 0 18d agoExplain → -
forge-prototype
No description.
TypeScript ★ 0 1mo agoExplain → -
sgl-ome ⑂
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
Go ★ 0 6mo agoExplain → -
openclaw-gateway-tunnel
A zero-dependency openclaw plugin that exposes the local gateway to the public internet via ngrok's embedded Node SDK. Design phase, seeking feedback.
★ 0 2mo agoExplain → -
awesome-openclaw ⑂
A curated list of OpenClaw resources, tools, skills, tutorials & articles. OpenClaw (formerly Moltbot / Clawdbot) — open-source self-hosted AI agent for WhatsApp, Telegram, Discord & 50+ integrations.
★ 0 2mo agoExplain → -
hiclaw ⑂
Open-source Agent Teams system with IM-based multi-Agent collaboration and human-in-the-loop oversight.
★ 0 3mo agoExplain → -
opencode ⑂
The open source coding agent.
TypeScript ★ 0 3mo agoExplain → -
dify ⑂
Production-ready platform for agentic workflow development.
★ 0 2mo agoExplain → -
hwameistor-operator ⑂
Operator that manages HwameiStor
★ 0 3mo agoExplain → -
hwameistor ⑂
Hwameistor is an HA local storage system for cloud-native stateful workloads.
★ 0 3mo agoExplain → -
gh-x
GitHub CLI extension for batch repo operations.
Go ★ 0 3mo agoExplain → -
drun-docs ⑂
D.run 文档站
HTML ★ 0 3mo agoExplain → -
spiderpool ⑂
Underlay and RDMA network solution of the Kubernetes, for bare metal, VM and any public cloud
★ 0 4mo agoExplain → -
samzong.github.io
Work, and reflections on life. Explore cloud native, share experiences, contemplate life.
TypeScript ★ 0 5mo agoExplain → -
plano ⑂
Ship agents faster. Plano is delivery infrastructure for agentic applications. A models-native proxy server & dataplane that offloads the plumbing work, so you stay focused on product logic.
★ 0 5mo agoExplain → -
routeworks.github.io
Website for RouterArena
TypeScript ★ 0 5mo agoExplain → -
RouterArena ⑂
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.
★ 0 5mo agoExplain → -
mirrormate
One-shot mirror injection for Docker builds and compose workflows.
Go ★ 0 5mo agoExplain → -
dataset ⑂
Simplified Data Management and Sharing for Kubernetes
★ 0 5mo agoExplain → -
prom-etl-db
A Go-based ETL tool that collects Prometheus metrics and stores them in MySQL database with scheduled execution.
Go ★ 0 6mo agoExplain → -
litellm ⑂
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]
★ 0 6mo agoExplain → -
llm-d ⑂
llm-d is a Kubernetes-native high-performance distributed LLM inference framework
Shell ★ 0 6mo agoExplain → -
llm-d-modelservice ⑂
helm charts for deploying models with llm-d
Smarty ★ 0 6mo agoExplain → -
remove-image-navbar
No description.
Python ★ 0 1y agoExplain → -
llm-d-kv-cache-manager ⑂
Distributed KV cache coordinator
★ 0 7mo agoExplain → -
vllm ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs
Python ★ 0 7mo agoExplain → -
helm-charts
Kubernetes Helm Charts for samzong's projects.
★ 0 7mo agoExplain → -
convostore
Lightweight Redis-backed conversation state service that normalizes prompts, trims history, and exposes APC-friendly context for OpenAI Responses API, vLLM, and custom inference gateways.
Go ★ 0 8mo agoExplain → -
cli-template
A production-ready template for creating Go CLI applications with Cobra/Viper, automated CI/CD, Docker support, and Homebrew publishing.
Shell ★ 0 7mo agoExplain → -
karmada-dashboard ⑂
Web UI for Karmada
Go ★ 0 7mo agoExplain → -
rbg ⑂
A workload for deploying LLM inference services on Kubernetes
★ 0 7mo agoExplain → -
training-sample-code
learn and try with tensorflow and pytorch
Python ★ 0 7mo agoExplain → -
production-stack ⑂
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Python ★ 0 8mo agoExplain → -
grafana ⑂
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
★ 0 8mo agoExplain → -
karmada ⑂
Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration
★ 0 8mo agoExplain → -
llm-d.github.io ⑂
Website for llm-d: This repository builds the website seen at llm-d.ai
JavaScript ★ 0 9mo agoExplain → -
action-qiniu-upload-nodejs ⑂
Github Action for Uploading Files to Qiniu.com
TypeScript ★ 0 9mo agoExplain → -
public-image-mirror ⑂
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。
Shell ★ 0 9mo agoExplain → -
iframe-dashboard
No description.
Go ★ 0 10mo agoExplain → -
kubernetes-website ⑂
Kubernetes website and documentation repo:
HTML ★ 0 11mo agoExplain → -
kgateway ⑂
The Cloud-Native API Gateway and AI Gateway
★ 0 11mo agoExplain → -
envoy ⑂
Cloud-native high-performance edge/middle/service proxy
★ 0 11mo agoExplain → -
hami-website ⑂
No description.
Shell ★ 0 11mo agoExplain → -
LMCache ⑂
Redis for LLMs
★ 0 11mo agoExplain → -
HAMi ⑂
Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)
★ 0 11mo agoExplain → -
aibrix ⑂
Cost-efficient and pluggable Infrastructure components for GenAI inference
★ 0 11mo agoExplain → -
llm-d-deployer ⑂
Helm charts for llm-d
★ 0 11mo agoExplain → -
sglang ⑂
SGLang is a fast serving framework for large language models and vision language models.
Python ★ 0 1y agoExplain → -
ShiArthur03-backup ⑂
No description.
★ 0 1y agoExplain →
No repos match these filters.