VQA-With-Multimodal-Transformers
Jupyter Notebook
★ 37
updated 4y ago
Exploring multimodal fusion-type transformer models for visual question answering (on DAQUAR dataset)
No plain-English explanation yet — one is being written right now. Check back in a minute.