gitmyhub

MGQA

Python ★ 16 updated 2y ago

The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints"

No plain-English explanation yet — one is being written right now. Check back in a minute.