Repository navigation

#

audioldm

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python
9420
4 个月前

(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages

Python
103
15 天前

[ICASSP'24] Investigating Personalization Methods in Text to Music Generation

Python
41
2 年前

A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the power of CogVideox + AudioLDM2 + Python!

Python
20
9 个月前

AudioLDM text to audio colab

Jupyter Notebook
19
2 年前

Simple web UI for AudioLDM 2.

Python
1
2 年前

Enhancing Diffusion-Based Music Generation Performance with LoRA.

Python
1
1 个月前

In this game, your given an image for so many seconds to view. Then you have to guess just by clicking on any point in the world that the photo was taken. NOTICE: This game is INCOMPLETE

JavaScript
0
1 个月前