Fast-dLLM
Python
★ 1.0k
updated 22d ago
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
No plain-English explanation yet — one is being written right now. Check back in a minute.