Repository navigation

#

music-generation

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python
8952
8 天前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
4824
13 天前

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python
1060
4 天前

🧠+🎧 Build your music algorithms and AI models with the next-gen DAW 🔥

Python
827
2 年前

MIDI / symbolic music tokenizers for Deep Learning models 🎶

Python
762
24 天前

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook
756
7 个月前

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

718
2 个月前

a list of demo websites for automatic music generation research

692
4 天前

"Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions", ACM Multimedia 2020

Python
570
2 年前

Train an LSTM to generate piano or violin/piano music.

Python
568
4 年前
Zig
566
1 个月前

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Python
541
2 年前
Python
475
1 个月前

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Python
455
3 个月前

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

426
3 年前