Repository navigation

#

tts-api

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python
3752
2 天前
shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook
1135
12 天前

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Python
1014
3 个月前

免费的在线文本转语音API

TypeScript
981
1 个月前

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

Python
556
3 个月前

A simple FastAPI Server to run XTTSv2

Python
543
1 年前

A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.

JavaScript
391
21 天前

Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.

Python
324
4 个月前

TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口

JavaScript
297
4 个月前

Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.

Python
202
2 个月前

openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.

JavaScript
162
2 年前

NoneBot DeepSeek 插件。接入 DeepSeek 模型,提供智能对话与问答功能

Python
160
1 个月前

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

Python
130
22 天前

🌻 VITS ONNX TTS server designed for fast inference 🔥

Python
128
8 个月前

Streaming TTS based on Piper with optional RK3588 NPU support

C++
107
5 个月前

Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.

JavaScript
88
1 年前

An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS for similarity search to provide more contextually relevant responses to user queries

Python
78
2 年前

A Non-Official ElevenLabs RESTful API Client for dotnet

C#
77
1 个月前

Simple Python script to interact with the TikTok TTS Voices.

Python
74
1 年前

CapCut TTS rapper API

TypeScript
71
1 年前