gitmyhub

local-attention

Python ★ 500 updated 11mo ago

An implementation of local windowed attention for language modeling

No plain-English explanation yet — one is being written right now. Check back in a minute.