gitmyhub

ShallowFF

Python ★ 13 updated 22h ago

Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"

No plain-English explanation yet — one is being written right now. Check back in a minute.