wit
★ 0
updated 4y ago
⑂ fork
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
No plain-English explanation yet — one is being written right now. Check back in a minute.