Repository navigation
audio-synthesis
- Website
- Wikipedia
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Official PyTorch implementation of BigVGAN (ICLR 2023)
Deep Convolutional Neural Networks for Musical Source Separation
A soundfont editor for quickly designing musical instruments.
openFrameworks addon for audio synthesis and generative music
Pytorch implementation of BigVSAN
Library for pure Rust advanced audio synthesis.
PC Music Generator - a Virtual Modular Synthesizer
A python toolkit for automatic audio/MIDI rendering using REAPER
Pythonic audio processing and generation framework
jazznet dataset of piano patterns for music audio machine learning research
Interactive audio in Jupyter
A creative coding library.
Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the YourTTS TTS model to clone and generate realistic audio waves
Text prompt steered synthetic audio generators
Wavetable creation and manipulation tool
Deep Performer: Score-to-audio music performance synthesis
Really-Real Time FM Tone Transfer Audio Pluigin