gitmyhub

SRPO

Python ★ 1.3k updated 1mo ago

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

No plain-English explanation yet — one is being written right now. Check back in a minute.