gitmyhub

RL4LMs

Python ★ 2.4k updated 2y ago

A modular RL library to fine-tune language models to human preferences

No plain-English explanation yet — one is being written right now. Check back in a minute.