gitmyhub

smoothquant

★ 0 updated 3y ago ⑂ fork

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

No plain-English explanation yet — one is being written right now. Check back in a minute.