gitmyhub

nanoChatGPT

Python ★ 0 updated 3y ago ⑂ fork

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

No plain-English explanation yet — one is being written right now. Check back in a minute.