ChatGH
CollectionsRankings

Collections

  • 人工智能
  • 应用开发
  • 区块链生态
  • 数据科学
  • 数据库
  • 开发者工具
  • DevOps
  • 游戏开发
  • 物联网 (IoT)
  • 学习资源
  • 媒体与流媒体
  • 中间件
  • 网络
  • 操作系统
  • 搜索引擎
  • 安全
  • 存储系统
  • 系统实用工具
  • Web 开发
  • 网页抓取
Collections
人工智能
语音与音频

语音与音频

语音识别、语音合成及音频处理框架。

Repositories

openai

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python
95.1k
ggml-org/whisper.cpp
ggml-org

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++
47.1k
RVC-Boss

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python
55.4k
CorentinJ/Real-Time-Voice-Cloning
CorentinJ

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python
59.4k
coqui-ai

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python
44.7k
mozilla

mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++
26.7k