LLMPrune-BESA
Python
★ 17
updated 2y ago
BESA is a differentiable weight pruning technique for large language models.
No plain-English explanation yet — one is being written right now. Check back in a minute.