gitmyhub

OpenStrawberry

Python ★ 31 updated 1d ago

An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO

No plain-English explanation yet — one is being written right now. Check back in a minute.