Repository navigation

#

music-generation

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python
9420
4 个月前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
5554
4 个月前

A fundamental toolkit designed for music, song, and audio generation

Python
1209
4 个月前

🧠+🎧 Build your music algorithms and AI models with the next-gen DAW 🔥

Python
844
2 年前

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

837
3 个月前

MIDI / symbolic music tokenizers for Deep Learning models 🎶

Python
798
1 个月前

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook
774
1 年前

a list of demo websites for automatic music generation research

727
1 个月前
Zig
605
1 个月前

"Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions", ACM Multimedia 2020

Python
585
3 年前

Train an LSTM to generate piano or violin/piano music.

Python
566
5 年前

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Python
548
2 年前

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Python
504
5 个月前
Python
490
1 个月前

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

452
3 年前