cost-optimal-gqa
Python
★ 4
updated 9mo ago
The code for the paper "Cost-Optimal Grouped-Query Attention for Long-Context Modeling"
No plain-English explanation yet — one is being written right now. Check back in a minute.