gitmyhub

cost-optimal-gqa

Python ★ 4 updated 9mo ago

The code for the paper "Cost-Optimal Grouped-Query Attention for Long-Context Modeling"

No plain-English explanation yet — one is being written right now. Check back in a minute.