Repository navigation

#

whisperx

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
4840
2 个月前
Python
2689
10 个月前

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

2535
5 个月前

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python
222
1 年前

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.

Jupyter Notebook
92
2 天前

Локальное и бесплатное распознавание речи с помощью OpenAI Whisper. Автоматизируйте расшифровку лекций и совещаний на вашем ПК без облачных сервисов и подписок

Python
36
12 天前

Meeting assistant that records, transcribes, and summarizes online meetings with AI. Python backend, Next.js frontend, real-time dashboard.

TypeScript
18
22 天前

WhisperX-powered voice transcription tool that types text directly at your cursor position. Hold F9 to record, release to transcribe.

Python
15
24 天前

a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model

Python
14
2 年前

you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center

Python
8
2 个月前

This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).

Jupyter Notebook
7
1 个月前

ASR (Automatic Speech Recognition) Notebooks

Jupyter Notebook
7
2 年前

User friendly toolkit for generating immersion language learning tools including downloading media, generating subtitles and creating Anki decks

Python
7
18 天前

Generate fully aligned subtitles for any Video or Audio file on your local system for free using the amazing capabilities of WhisperX.

Python
6
6 个月前

A self-hostable platform on which users can create transcripts of their audio files (speech-to-text) using Whisper AI

Python
5
4 天前