memory-compressed-attention
Python
★ 71
updated 3y ago
Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"
No plain-English explanation yet — one is being written right now. Check back in a minute.