tair-kvcache
C++
★ 197
updated 20h ago
Alibaba Cloud's high-performance KVCache system for LLM inference, with components for global cache management, inference simulation(HiSim), and more.
No plain-English explanation yet — one is being written right now. Check back in a minute.