gitmyhub

SparseAttention

Python ★ 97 updated 1d ago

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"

No plain-English explanation yet — one is being written right now. Check back in a minute.