gitmyhub

petastorm

Python ★ 1.9k updated 5mo ago

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

No plain-English explanation yet — one is being written right now. Check back in a minute.