ByteTransformer
C++
★ 478
updated 2y ago
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
No plain-English explanation yet — one is being written right now. Check back in a minute.