Repository navigation

#

whisperx

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
3608
4 天前
Python
2314
4 个月前

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

1958
8 小时前

turnkey self-hosted offline transcription and diarization service with llm summary

Python
836
7 个月前

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python
196
6 个月前

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.

Jupyter Notebook
88
8 天前

a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model

Python
12
2 年前

ASR (Automatic Speech Recognition) Notebooks

Jupyter Notebook
6
2 年前

User friendly toolkit for generating immersion language learning tools including downloading media, generating subtitles and creating Anki decks

Python
6
5 小时前

A sleek, web-based audio player featuring synchronized subtitle display, speaker diarization support, and keyboard controls in a modern, responsive interface

JavaScript
5
3 个月前

This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).

Jupyter Notebook
4
15 天前

AI 驱动的视频译配工具. An AI powered tool to execute end-to-end video dubbing.

Python
4
18 天前

Generate fully aligned subtitles for any Video or Audio file on your local system for free using the amazing capabilities of WhisperX.

Python
4
2 天前

Code for our INTERSPEECH 2024 paper: Comparing ASR Systems in the Context of Speech Disfluencies.

Jupyter Notebook
3
1 年前

A tool for automatically adding subtitles to short social media videos

Python
2
9 个月前

VideoWise is a video transcription and AI-powered analysis tool that helps users easily upload, transcribe, and interact with video content. Using WhisperX for high-quality transcriptions and Ollama for AI-driven insights, VideoWise makes it easy to search, analyze, and export video data.

HTML
2
5 个月前