gitmyhub

OPD

Python ★ 689 updated 21d ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

No plain-English explanation yet — one is being written right now. Check back in a minute.