JaxTransformer
Python
★ 8
updated 2d ago
This repository demonstrates how to build a Decoder-Only Transformer with Multi-Query Attention in JAX.
No plain-English explanation yet — one is being written right now. Check back in a minute.