Repository navigation

#

tts-api

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python
3488
21 天前
shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook
1096
8 天前

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Python
1007
1 个月前

免费的在线文本转语音API

TypeScript
955
1 天前

A simple FastAPI Server to run XTTSv2

Python
534
1 年前

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

Python
451
1 个月前

A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.

JavaScript
393
9 天前

Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.

Python
307
3 个月前

TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口

JavaScript
266
2 个月前

Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.

Python
182
13 天前

openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.

JavaScript
162
2 年前

NoneBot DeepSeek 插件。接入 DeepSeek 模型,提供智能对话与问答功能

Python
159
12 天前

🌻 VITS ONNX TTS server designed for fast inference 🔥

Python
130
7 个月前

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

Python
123
2 天前

Streaming TTS based on Piper with optional RK3588 NPU support

C++
103
4 个月前

Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.

JavaScript
86
9 个月前

An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS for similarity search to provide more contextually relevant responses to user queries

Python
77
2 年前

Simple Python script to interact with the TikTok TTS Voices.

Python
71
1 年前

CapCut TTS rapper API

TypeScript
71
1 年前