gitmyhub

vime

Python ★ 320 updated 1d ago

An LLM post-training framework with vLLM for RL Scaling

No plain-English explanation yet — one is being written right now. Check back in a minute.