minimind
★ 0
updated 3d ago
⑂ fork
🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!
No plain-English explanation yet — one is being written right now. Check back in a minute.
🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!
No plain-English explanation yet — one is being written right now. Check back in a minute.