gitmyhub

LLMPrune-BESA

Python ★ 17 updated 2y ago

BESA is a differentiable weight pruning technique for large language models.

No plain-English explanation yet — one is being written right now. Check back in a minute.