gitmyhub

diffq

Python ★ 239 updated 3y ago ▣ archived

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

No plain-English explanation yet — one is being written right now. Check back in a minute.