gitmyhub

smoothquant

Python ★ 1.7k updated 1y ago

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

No plain-English explanation yet — one is being written right now. Check back in a minute.