nanotron
Python
★ 2.7k
updated 25d ago
Minimalistic large language model 3D-parallelism training
No plain-English explanation yet — one is being written right now. Check back in a minute.
Minimalistic large language model 3D-parallelism training
No plain-English explanation yet — one is being written right now. Check back in a minute.