Repository navigation

voice-synthesis

Website
Wikipedia

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis voice-cloning voice-synthesis voice-conversion

Python

42854

5661

1 年前

denizsafak / abogen

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

audiobook audiobooks content-creation content-creator epub-converter kokoro media-generation narrator speech-synthesis subtitles text-to-audio text-to-speech tts voice-synthesis kokoro-82m kokoro-tts

Python

3660

201

17 天前

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

datasets dataset voice data voice-control voice-synthesis voice-commands voice-assistant voice-recognition voice-chat voice-activity-detection voice-conversion noise

2033

250

1 年前

panyanyany / Twocast

AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM；真人对话AI播客生成器，多语言，多音色

podcast podcast-generator voice-cloning voice-synthesis

TypeScript

1056

3 个月前

DanRuta / xVA-Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

voice-synthesis tacotron 机器学习 Electron skyrim fallout speech-synthesis

JavaScript

631

1 年前

SforAiDl / Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

voice-cloning voice-synthesis 深度学习 speaker-adaptation tts speech-processing speaker-encodings voice

Python

436

122

5 年前

ManimCommunity / manim-voiceover

Manim plugin for all things voiceover

text-to-speech manim tts 人工智能 speech-synthesis voice-synthesis

Python

248

8 个月前

hujinsen / pytorch-StarGAN-VC

Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .

voice-conversion voice-converter stargan PyTorch pytorch-implementation voice-synthesis

Python

247

2 年前

ZDisket / TensorVox

Desktop application for neural speech synthesis written in C++

fastspeech2 voice-synthesis tts Desktop tacotron2 speech-synthesis text-to-speech real-time

C++

213

3 年前

zakaton / Pink-Trombone

A programmable version of Neil Thapen's Pink Trombone

speech-synthesis web-audio API Web Components voice voice-synthesis

JavaScript

185

9 个月前

anhnh2002 / XTTSv2-Finetuning-for-New-Languages

speech-synthesis text-to-speech voice-cloning voice-synthesis

Python

170

10 个月前

smoke-trees / Voice-synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

voice-synthesis voice-cloning pytorch-implementation Tensorflow Keras speech-to-text

Python

169

5 年前

JollyToday / GhostCut-auto_video_translation

auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音，自动翻译视频字幕和回填样式，自动硬字幕翻译。

inpainting subtitles voice-synthesis Material Design video-api video-subtitles video-translation FFmpeg moviepy tts

Python

160

2 年前

nipponjo / tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

arabic hifi-gan PyTorch tacotron2 text-to-speech tts Python 深度学习 speech speech-synthesis hifigan multi-speaker-tts tts-model voice-synthesis

Jupyter Notebook

122

1 年前

Azure-Samples / Cognitive-Services-Voice-Assistant

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

Bot botframework bot-framework SDK .NET WPF microsoft-cognitive-services microsoft-bot-framework Microsoft speech-recognition speech-to-text voice-assistant voice-control voice-commands voice-synthesis

C++

121

103

2 年前

RageAgainstThePixel / com.rest.elevenlabs

A non-official Eleven Labs voice synthesis client for Unity (UPM)

人工智能 Unity 机器学习 tts upm voice-synthesis

101

1 个月前

sidmulajkar / sentiment-predictor-for-stress-detection

Voice stress analysis (VSA) aims to differentiate between stressed and non-stressed outputs in response to stimuli (e.g., questions posed), with high stress seen as an indication of deception. In this work, we propose a deep learning-based psychological stress detection model using speech signals. With increasing demands for communication between humans and intelligent systems, automatic stress detection is becoming an interesting research topic. Stress can be reliably detected by measuring the level of specific hormones (e.g., cortisol), but this is not a convenient method for the detection of stress in human- machine interactions. The proposed algorithm first extracts Mel- filter bank coefficients using pre-processed speech data and then predicts the status of stress output using a binary decision criterion (i.e., stressed or unstressed) using CNN (Convolutional Neural Network) and dense fully connected layer networks.

deception voice convolutional-neural-network emotion-recognition emotion emotion-detection voice-synthesis stress-testing 深度学习

Jupyter Notebook

4 年前

RageAgainstThePixel / ElevenLabs-DotNet

A Non-Official ElevenLabs RESTful API Client for dotnet

人工智能机器学习 tts tts-api voice-synthesis .NET speech-synthesis

1 个月前