gitmyhub

JaxTransformer

Python ★ 8 updated 2d ago

This repository demonstrates how to build a Decoder-Only Transformer with Multi-Query Attention in JAX.

No plain-English explanation yet — one is being written right now. Check back in a minute.