Foley-Omni
Foley-Omni: a unified multimodal audio generation model for task-level synthesis and complete video soundtrack generation, producing speech, sound effects, and music conditioned on text and video.
AI research tool from Nanjing University that generates synchronized soundtracks (speech, sound effects, and music together) for silent video clips using a text prompt to describe what you want to hear.
Python
★ 22
14d ago
Explain →