Repository navigation
audioldm
- Website
- Wikipedia
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
A webui for different audio related Neural Networks
OpenMusic: SOTA Text-to-music (TTM) Generation
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
Text prompt steered synthetic audio generators
Code for Investigating Personalization Methods in Text to Music Generation
AudioLDM text to audio colab
A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the power of CogVideox + AudioLDM2 + Python!
Generative AI version of the GeoGuesser game.
Workshop for Multimodale media generator