gitmyhub

minimind

★ 0 updated 3d ago ⑂ fork

🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!

No plain-English explanation yet — one is being written right now. Check back in a minute.