transformer-smaller-training-vocab
★ 1
updated 1y ago
⑂ fork
Temporary remove unused tokens during training to save ram and speed.
No plain-English explanation yet — one is being written right now. Check back in a minute.