Qwen2.5-Omni
Jupyter Notebook
★ 4.0k
updated 1y ago
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
No plain-English explanation yet — one is being written right now. Check back in a minute.