gitmyhub

xllm

C++ ★ 1.3k updated 2d ago

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

No plain-English explanation yet — one is being written right now. Check back in a minute.