xllm
C++
★ 1.3k
updated 2d ago
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
No plain-English explanation yet — one is being written right now. Check back in a minute.