Repository navigation

#

music-generation

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python
9302
3 个月前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
5400
3 个月前

A fundamental toolkit designed for music, song, and audio generation

Python
1175
3 个月前

🧠+🎧 Build your music algorithms and AI models with the next-gen DAW 🔥

Python
845
2 年前

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

809
1 个月前

MIDI / symbolic music tokenizers for Deep Learning models 🎶

Python
793
2 天前

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook
770
1 年前

a list of demo websites for automatic music generation research

717
7 天前
Zig
595
7 天前

"Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions", ACM Multimedia 2020

Python
580
3 年前

Train an LSTM to generate piano or violin/piano music.

Python
571
5 年前

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Python
549
2 年前

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Python
499
3 个月前
Python
490
2 天前

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

443
3 年前