OpenStrawberry
Python
★ 31
updated 1d ago
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
No plain-English explanation yet — one is being written right now. Check back in a minute.