gitmyhub

TokLIP

Python ★ 236 updated 10mo ago

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

No plain-English explanation yet — one is being written right now. Check back in a minute.