gitmyhub

llm-compressor

★ 0 updated 6mo ago ⑂ fork

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

No plain-English explanation yet — one is being written right now. Check back in a minute.