Fast-dLLM

Python ★ 1.0k updated 22d ago

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

No plain-English explanation yet — one is being written right now. Check back in a minute.