gitmyhub

pile-pubmedcentral

Python ★ 26 updated 5y ago

A script for collecting the PubMed Central dataset in a language modelling friendly format.

No plain-English explanation yet — one is being written right now. Check back in a minute.