native-sparse-attention-triton
★ 1
updated 11mo ago
⑂ fork
Efficient triton implementation of Native Sparse Attention.
No plain-English explanation yet — one is being written right now. Check back in a minute.