nemesis

Python ★ 63 updated 3y ago

Reward Model framework for LLM RLHF

No plain-English explanation yet — one is being written right now. Check back in a minute.