AQLM
★ 0
updated 1y ago
⑂ fork
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852
No plain-English explanation yet — one is being written right now. Check back in a minute.