gitmyhub

gradient-accumulation-blog

Python ★ 32 updated 3y ago

Finetuning BLOOM on a single GPU using gradient-accumulation

No plain-English explanation yet — one is being written right now. Check back in a minute.