gitmyhub

awesome-omni-modal-papers

★ 7 updated 1y ago

An awesome list of omni-modality LLM models that can perceive and generate images, videos, audios, and more all at once

No plain-English explanation yet — one is being written right now. Check back in a minute.