hands-on-modern-rl
β
0
updated 15d ago
β fork
π An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
No plain-English explanation yet β one is being written right now. Check back in a minute.