dash-infer
C
★ 272
updated 10mo ago
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
No plain-English explanation yet — one is being written right now. Check back in a minute.