gitmyhub

reward-bench

Python ★ 721 updated 4mo ago

RewardBench: the first evaluation tool for reward models.

No plain-English explanation yet — one is being written right now. Check back in a minute.