gitmyhub

dolma

Python ★ 1.5k updated 7mo ago

Data and tools for generating and inspecting OLMo pre-training data.

No plain-English explanation yet — one is being written right now. Check back in a minute.