grokfast-pytorch
Python
★ 104
updated 1y ago
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
No plain-English explanation yet — one is being written right now. Check back in a minute.