Sparse-GPT-Pretraining
Python
★ 24
updated 4mo ago
A codebase for pretraining multi-billion-scale sparse GPTs.
No plain-English explanation yet — one is being written right now. Check back in a minute.
A codebase for pretraining multi-billion-scale sparse GPTs.
No plain-English explanation yet — one is being written right now. Check back in a minute.