-
mustango
Mustango: Toward Controllable Text-to-Music Generation
Python ★ 392 1y agoExplain → -
Video2Music
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
Python ★ 194 1y agoExplain → -
SonicMaster
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Python ★ 181 14d agoExplain → -
Text2midi
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
Python ★ 171 1y agoExplain → -
MidiCaps
A large-scale dataset of caption-annotated MIDI files.
Python ★ 84 1y agoExplain → -
awesome-MER
A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)
★ 78 1y agoExplain → -
Music2Emotion
Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models
Python ★ 54 9mo agoExplain → -
SonicVerse
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
Python ★ 53 10mo agoExplain → -
JamendoMaxCaps
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
Python ★ 51 1y agoExplain → -
mirflex
Music Information Retrieval Feature Library for Extraction
Python ★ 48 1y agoExplain → -
nnAudio2
No description.
Jupyter Notebook ★ 34 11d agoExplain → -
MelodySim
MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection
Python ★ 29 1y agoExplain → -
MERIT
No description.
Python ★ 26 16d agoExplain → -
t2m-inferalign
Improving Symbolic Music Generation with Inference-Time Alignment
Python ★ 22 10mo agoExplain → -
MuVi
Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses
Python ★ 22 2y agoExplain → -
DART
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
Python ★ 15 1mo agoExplain → -
ai-audio-datasets-list ⑂
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
★ 13 2y agoExplain → -
PreBit
This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin"
Jupyter Notebook ★ 12 10mo agoExplain → -
megamusicaps
No description.
Python ★ 11 1y agoExplain → -
Karma-MV
A Benchmark for Causal Question Answering on Music Videos
Python ★ 10 28d agoExplain → -
cross-dataset-emotion-alignment
code for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Python ★ 10 1y agoExplain → -
apex
APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music
Python ★ 9 1mo agoExplain → -
MineROI-Net
Smart Timing for Mining: A Deep Learning Framework for Bitcoin Hardware ROI prediction
Python ★ 9 1mo agoExplain → -
Accented-TTS-MLVAE-ADV
No description.
Python ★ 8 2y agoExplain → -
DisfluencySpeech
Resources for DisfluencySpeech
★ 8 1y agoExplain → -
calm-me-down
No description.
TypeScript ★ 3 1d agoExplain → -
relational-ai
Scaffolded Vulnerability: Chatbot-Mediated Reciprocal Self-Disclosure and Need-Supportive Interaction in Couples
Python ★ 2 1mo agoExplain → -
CM-HRNN ⑂
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure
Python ★ 2 2y agoExplain → -
genmusic_demo_list ⑂
a list of demo websites for automatic music generation research
★ 2 2y agoExplain → -
multimodal-generative-ai-course ⑂
Multimodal Generative AI course
★ 1 2mo agoExplain → -
nnAudio ⑂
Audio processing by using pytorch 1D convolution network
Python ★ 1 2mo agoExplain → -
EAIM2026
1 st Workshop on Emerging AI Technologies for Music, part of AAAI
CSS ★ 1 1mo agoExplain → -
AMAAI-Lab.github.io ⑂
AMAAI Lab website
HTML ★ 1 7mo agoExplain → -
to-embody-or-not
Repo for paper: To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents
Python ★ 1 1y agoExplain → -
emotionweb
Website emotion guidance
JavaScript ★ 1 2y agoExplain → -
IAMM
An exploration of how generative text-to-music AI models can be used for emotion guidance
★ 1 1y agoExplain → -
survey-music-nlp ⑂
Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"
★ 1 1y agoExplain → -
kylo-ren-app ⑂
Web interface for AI music generation models
JavaScript ★ 1 2y agoExplain → -
Audio-Music-AI-Research-Resources
No description.
★ 1 1y agoExplain → -
CVAE-Tacotron ⑂
Conditional VAE for Accented Speech Generation
★ 1 2y agoExplain → -
singapore-music-classifier
Code for paper A dataset and classification model for Malay, Hindi, Tamil and Chinese music
Jupyter Notebook ★ 1 2y agoExplain → -
examples_sonic
No description.
★ 0 6mo agoExplain → -
BandCondiNet ⑂
BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features
★ 0 8mo agoExplain → -
MERP ⑂
Dataset and benchmark models for emotion prediction of music with profile info
★ 0 3y agoExplain → -
midi-miner ⑂
Python MIDI track classifier and tonal tension calculation based on spiral array theory
★ 0 2y agoExplain → -
FundamentalMusicEmbedding ⑂
Fundamental Music Embedding, FME
★ 0 2y agoExplain → -
Conditional-Drums-Generation-using-Compound-Word-Representations ⑂
Conditional Drums Generation using Compound Word Representations
★ 0 3y agoExplain → -
VAE-for-expressive-piano ⑂
Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".
★ 0 5y agoExplain → -
Revisiting-the-Onsets-and-Frames-Model-with-Additive-Attention ⑂
Revisiting the Onsets and Frames Model with Additive Attention
★ 0 5y agoExplain → -
IJCNN2020_music_emotion ⑂
Regression-based Music Emotion Prediction using Triplet Neural Networks
★ 0 3y agoExplain → -
demucs_lightning ⑂
Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features
★ 0 3y agoExplain → -
ReconVAT ⑂
ReconVAT: a semi-supervised automatic music transcription (AMT) model
★ 0 2y agoExplain → -
Jointist ⑂
Official Implementation of Jointist
★ 0 2y agoExplain → -
AudioLoader ⑂
PyTorch Dataset for Speech and Music audio
★ 0 2y agoExplain → -
DiffRoll ⑂
PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model
★ 0 3y agoExplain → -
colab_tension_vae ⑂
A variational autoencoder for music generation with tension control
★ 0 5y agoExplain → -
LeadSheetGen_Valence ⑂
A novel seq2seq framework where high-level musicalities (such us the valence of the chord progression) are fed to the Encoder, and they are "translated" to lead sheet events in the Decoder. For further details please read and cite our paper:
Python ★ 0 3y agoExplain → -
MusIAC ⑂
music inpainting control
★ 0 4y agoExplain → -
AMAAI-guidebook
No description.
TeX ★ 0 4y agoExplain → -
HEAR_2021_NeurIPS_Challenge_SUTD_AMAAI
No description.
★ 0 4y agoExplain → -
datasets_emotion ⑂
This repository collects information about different data sets for Music Emotion Recognition.
★ 0 4y agoExplain → -
music-fader-nets ⑂
Code accompanying ISMIR 2020 paper - "Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling".
★ 0 5y agoExplain →
No repos match these filters.