gpt_from_scratch
Jupyter Notebook
★ 1
updated 6mo ago
This notebook builds a complete GPT (Generative Pre-trained Transformer) model from scratch using PyTorch. It covers tokenization, self-attention, multi-head attention, transformer blocks, and text generation and all explained step-by-step with a simple nursery rhyme corpus.
No plain-English explanation yet — one is being written right now. Check back in a minute.