NJU- Speech ORG

1 repos
10 followers
0 following

Python 100%

All public repos (1)

Show forks Show archived

Foley-Omni

Foley-Omni: a unified multimodal audio generation model for task-level synthesis and complete video soundtrack generation, producing speech, sound effects, and music conditioned on text and video.

AI research tool from Nanjing University that generates synchronized soundtracks (speech, sound effects, and music together) for silent video clips using a text prompt to describe what you want to hear.

Python ★ 22 14d ago
Explain →