gitmyhub

CoLT5-attention

Python ★ 230 updated 1y ago

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

No plain-English explanation yet — one is being written right now. Check back in a minute.