gitmyhub

H2O

Python ★ 522 updated 1y ago

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

No plain-English explanation yet — one is being written right now. Check back in a minute.