cold-compress
Python
★ 151
updated 1y ago
Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.
No plain-English explanation yet — one is being written right now. Check back in a minute.