gitmyhub

VILA-1

★ 0 updated 2y ago ⑂ fork

VILA - A multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

No plain-English explanation yet — one is being written right now. Check back in a minute.