gitmyhub

tiny-llm

Python ★ 0 updated 9mo ago ⑂ fork

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

No plain-English explanation yet — one is being written right now. Check back in a minute.