gitmyhub

gpt_from_scratch

Jupyter Notebook ★ 1 updated 6mo ago

This notebook builds a complete GPT (Generative Pre-trained Transformer) model from scratch using PyTorch. It covers tokenization, self-attention, multi-head attention, transformer blocks, and text generation and all explained step-by-step with a simple nursery rhyme corpus.

No plain-English explanation yet — one is being written right now. Check back in a minute.