X-CLIP
★ 0
updated 3y ago
⑂ fork
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
No plain-English explanation yet — one is being written right now. Check back in a minute.