Repository navigation

#

audio-generation

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

Go
31887
38 分钟前
Python
13160
7 小时前
open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python
8950
7 天前

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python
4821
12 天前

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python
2617
4 个月前

Text-to-Audio/Music Generation

Python
2405
7 个月前
rsxdalv/tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)

TypeScript
2110
10 小时前

Audio generation using diffusion models, in PyTorch.

Python
2035
2 年前

A timeline of the latest AI models for audio generation, starting in 2023!

1898
1 年前

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python
1490
6 个月前
declare-lab/tango
Python
1160
4 个月前

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python
1059
3 天前
Python
1001
7 个月前

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

718
2 个月前

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python
419
10 个月前

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python
396
1 年前

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

379
17 小时前
Jupyter Notebook
360
9 个月前

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

354
7 个月前
Python
279
9 个月前