gitmyhub

Flash-RL

Python ★ 304 updated 7mo ago

Implementation for FP8/INT8 Rollout for RL training without performence drop.

No plain-English explanation yet — one is being written right now. Check back in a minute.