self-rewarding-lm-pytorch
Python
★ 1.4k
updated 2y ago
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
No plain-English explanation yet — one is being written right now. Check back in a minute.