gitmyhub

LongNet

Python ★ 720 updated 2y ago

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

No plain-English explanation yet — one is being written right now. Check back in a minute.