WeDLM
Python
★ 644
updated 3mo ago
WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.
No plain-English explanation yet — one is being written right now. Check back in a minute.