Repository navigation

#

voice-assistant

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Python
8440
4 个月前

Open Source framework for voice and multimodal conversational AI

Python
5662
5 小时前

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.

Python
5633
1 天前

Voice assistant made as an experiment using neural networks for things like STT/TTS/Wake Word/NLU etc.

Rust
2337
1 年前
alan-ai/alan-sdk-ios

Conversational AI SDK for iOS to enable text and voice conversations with actions (Swift, Objective-C)

Objective-C
1908
10 个月前
alan-ai/alan-sdk-android

Conversational AI SDK for Android to enable text and voice conversations with actions (Java, Kotlin)

1836
10 个月前

Conversational AI SDK for Flutter to enable text and voice conversations with actions (iOS and Android)

Ruby
1796
1 年前

🔈 The React for Voice and Chat: Build Apps for Alexa, Messenger, Instagram, the Web, and more

TypeScript
1679
10 个月前

Conversational AI SDK for Ionic to enable text and voice conversations with actions (React, Angular, Vue)

TypeScript
1678
1 年前
alan-ai/alan-sdk-cordova

Conversational AI SDK for Apache Cordova to enable text and voice conversations with actions (iOS and Android)

Ruby
1153
1 年前

百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断

Python
1145
1 个月前

Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.

Python
896
1 个月前