gitmyhub

ByteTransformer

C++ ★ 478 updated 2y ago

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

No plain-English explanation yet — one is being written right now. Check back in a minute.