AMAAI Lab ORG

@AMAAI-Lab ·Singapore ·dorienherremans.com

The Audio, Music, and AI Lab at Singapore University of Technology and Design (SUTD)

62 repos
128 followers
0 following

Python 76%
Jupyter Notebook 10%
TypeScript 3%
CSS 3%
JavaScript 3%

All public repos (62)

Show forks Show archived

mustango

Mustango: Toward Controllable Text-to-Music Generation

Python ★ 392 1y ago
Explain →
Video2Music

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Python ★ 194 1y ago
Explain →
SonicMaster

SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

Python ★ 181 14d ago
Explain →
Text2midi

Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.

Python ★ 171 1y ago
Explain →
MidiCaps

A large-scale dataset of caption-annotated MIDI files.

Python ★ 84 1y ago
Explain →
awesome-MER

A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)

★ 78 1y ago
Explain →
Music2Emotion

Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models

Python ★ 54 9mo ago
Explain →
SonicVerse

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

Python ★ 53 10mo ago
Explain →
JamendoMaxCaps

JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks

Python ★ 51 1y ago
Explain →
mirflex

Music Information Retrieval Feature Library for Extraction

Python ★ 48 1y ago
Explain →
nnAudio2

No description.

Jupyter Notebook ★ 34 11d ago
Explain →
MelodySim

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Python ★ 29 1y ago
Explain →
MERIT

No description.

A Python tool that compares two audio files across melody, rhythm, and timbre separately, returning three independent similarity scores instead of a single blended number.

Python ★ 26 16d ago
Explain →
t2m-inferalign

Improving Symbolic Music Generation with Inference-Time Alignment

Python ★ 22 10mo ago
Explain →
MuVi

Predicting emotion from music videos: exploring the relative contribution of visual and auditory information on affective responses

Python ★ 22 2y ago
Explain →
DART

Demo for DART, Audio Imagination workshop submission in NeurIPS 2024

Python ★ 15 1mo ago
Explain →
ai-audio-datasets-list ⑂

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

★ 13 2y ago
Explain →
PreBit

This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin"

Jupyter Notebook ★ 12 10mo ago
Explain →
megamusicaps

No description.

Python ★ 11 1y ago
Explain →
Karma-MV

A Benchmark for Causal Question Answering on Music Videos

Python ★ 10 28d ago
Explain →
cross-dataset-emotion-alignment

code for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction

Python ★ 10 1y ago
Explain →
apex

APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music

Python ★ 9 1mo ago
Explain →
MineROI-Net

Smart Timing for Mining: A Deep Learning Framework for Bitcoin Hardware ROI prediction

Python ★ 9 1mo ago
Explain →
Accented-TTS-MLVAE-ADV

No description.

Python ★ 8 2y ago
Explain →
DisfluencySpeech

Resources for DisfluencySpeech

★ 8 1y ago
Explain →
calm-me-down

No description.

TypeScript ★ 3 1d ago
Explain →
relational-ai

Scaffolded Vulnerability: Chatbot-Mediated Reciprocal Self-Disclosure and Need-Supportive Interaction in Couples

Python ★ 2 1mo ago
Explain →
CM-HRNN ⑂

Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure

Python ★ 2 2y ago
Explain →
genmusic_demo_list ⑂

a list of demo websites for automatic music generation research

★ 2 2y ago
Explain →
multimodal-generative-ai-course ⑂

Multimodal Generative AI course

★ 1 2mo ago
Explain →
nnAudio ⑂

Audio processing by using pytorch 1D convolution network

Python ★ 1 2mo ago
Explain →
EAIM2026

1 st Workshop on Emerging AI Technologies for Music, part of AAAI

CSS ★ 1 1mo ago
Explain →
AMAAI-Lab.github.io ⑂

AMAAI Lab website

HTML ★ 1 7mo ago
Explain →
to-embody-or-not

Repo for paper: To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents

Python ★ 1 1y ago
Explain →
emotionweb

Website emotion guidance

JavaScript ★ 1 2y ago
Explain →
IAMM

An exploration of how generative text-to-music AI models can be used for emotion guidance

★ 1 1y ago
Explain →
survey-music-nlp ⑂

Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"

★ 1 1y ago
Explain →
kylo-ren-app ⑂

Web interface for AI music generation models

JavaScript ★ 1 2y ago
Explain →
Audio-Music-AI-Research-Resources

No description.

★ 1 1y ago
Explain →
CVAE-Tacotron ⑂

Conditional VAE for Accented Speech Generation

★ 1 2y ago
Explain →
singapore-music-classifier

Code for paper A dataset and classification model for Malay, Hindi, Tamil and Chinese music

Jupyter Notebook ★ 1 2y ago
Explain →
examples_sonic

No description.

★ 0 6mo ago
Explain →
BandCondiNet ⑂

BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features

★ 0 8mo ago
Explain →
MERP ⑂

Dataset and benchmark models for emotion prediction of music with profile info

★ 0 3y ago
Explain →
midi-miner ⑂

Python MIDI track classifier and tonal tension calculation based on spiral array theory

★ 0 2y ago
Explain →
FundamentalMusicEmbedding ⑂

Fundamental Music Embedding, FME

★ 0 2y ago
Explain →
Conditional-Drums-Generation-using-Compound-Word-Representations ⑂

Conditional Drums Generation using Compound Word Representations

★ 0 3y ago
Explain →
VAE-for-expressive-piano ⑂

Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".

★ 0 5y ago
Explain →
Revisiting-the-Onsets-and-Frames-Model-with-Additive-Attention ⑂

Revisiting the Onsets and Frames Model with Additive Attention

★ 0 5y ago
Explain →
IJCNN2020_music_emotion ⑂

Regression-based Music Emotion Prediction using Triplet Neural Networks

★ 0 3y ago
Explain →
demucs_lightning ⑂

Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features

★ 0 3y ago
Explain →
ReconVAT ⑂

ReconVAT: a semi-supervised automatic music transcription (AMT) model

★ 0 2y ago
Explain →
Jointist ⑂

Official Implementation of Jointist

★ 0 2y ago
Explain →
AudioLoader ⑂

PyTorch Dataset for Speech and Music audio

★ 0 2y ago
Explain →
DiffRoll ⑂

PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model

★ 0 3y ago
Explain →
colab_tension_vae ⑂

A variational autoencoder for music generation with tension control

★ 0 5y ago
Explain →
LeadSheetGen_Valence ⑂

A novel seq2seq framework where high-level musicalities (such us the valence of the chord progression) are fed to the Encoder, and they are "translated" to lead sheet events in the Decoder. For further details please read and cite our paper:

Python ★ 0 3y ago
Explain →
MusIAC ⑂

music inpainting control

★ 0 4y ago
Explain →
AMAAI-guidebook

No description.

TeX ★ 0 4y ago
Explain →
HEAR_2021_NeurIPS_Challenge_SUTD_AMAAI

No description.

★ 0 4y ago
Explain →
datasets_emotion ⑂

This repository collects information about different data sets for Music Emotion Recognition.

★ 0 4y ago
Explain →
music-fader-nets ⑂

Code accompanying ISMIR 2020 paper - "Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling".

★ 0 5y ago
Explain →