gitmyhub

custom_flashinfer

Cuda ★ 7 updated 11mo ago ⑂ fork

FlashInfer: Kernel Library for LLM Serving

No plain-English explanation yet — one is being written right now. Check back in a minute.