turbohaul-manager
★ 0
updated 21d ago
⑂ fork
Ollama-shape inference manager using Tom's TurboQuant llama.cpp. FIFO queue + grace + IDLE_HOT hot-hold + model swap on Blackwell.
No plain-English explanation yet — one is being written right now. Check back in a minute.