Flash-RL
Python
★ 304
updated 7mo ago
Implementation for FP8/INT8 Rollout for RL training without performence drop.
No plain-English explanation yet — one is being written right now. Check back in a minute.