gitmyhub

spark

Scala ★ 0 updated 2y ago ⑂ fork

This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such as native implementation of last join and multi-window parallelization. Its APIs are fully compatible with the standard Spark. It is designed to be a component of OpenMLDB (https://github.com/4paradigm/OpenMLDB).

No plain-English explanation yet — one is being written right now. Check back in a minute.