Repository navigation

whisperx

Website
Wikipedia

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp voice-cloning podcasts audiobook voice-conversion karaoke whisperx

Python

4840

414

2 个月前

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

faster-whisper openai transcribe vad Whisper whisperx asr

Python

2689

155

10 个月前

Purfview / whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

openai speech-to-text Whisper asr speech-recognition subtitles ctranslate2 faster-whisper whisperx uvr diarization speaker-diarization

2535

131

5 个月前

transcriptionstream / transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

自动化 diarization 大语言模型 speaker-diarization speech-recognition transcription Whisper ollama mistral-7b whisperx

Python

889

1 年前

Pikurrot / whisper-gui

A simple GUI to use Whisper.

gradio GUI Whisper whisper-ai whisperx huggingface transformers interface speech-recognition speech-to-text

Python

230

3 个月前

HenestrosaDev / audiotext

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python speech-recognition audio-to-text speech-to-text subtitles-generator whisperx FFmpeg

Python

222

1 年前

kurianbenoy / Indic-Subtitler

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.

asr FastAPI Next Web app 深度学习 faster-whisper inference openai quantization speech-recognition speech-to-text transformers Whisper whisperx

Jupyter Notebook

2 天前

empenoso / offline-audio-transcriber

Локальное и бесплатное распознавание речи с помощью OpenAI Whisper. Автоматизируйте расшифровку лекций и совещаний на вашем ПК без облачных сервисов и подписок

Whisper whisperx diarization

Python

12 天前

pulijon / Sttcast

Transcription from mp3 files to html with or without embedded player

Ansible 自动化 Infrastructure as code Puppet Python Terraform transcription Vagrant Whisper aws-ec2 aws-s3 gpu diarization whisperx 人工智能 openai-api rag pyannote

Jupyter Notebook

2 天前

ih3xcode / h3xassist

Meeting assistant that records, transcribes, and summarizes online meetings with AI. Python backend, Next.js frontend, real-time dashboard.

自动化 browser-automation FastAPI Google Meet Linux meeting-assistant microsoft-teams Next Playwright Python real-time speaker-diarization speech-recognition transcription WebSocket whisperx

TypeScript

22 天前

Aaronontheweb / witticism

WhisperX-powered voice transcription tool that types text directly at your cursor position. Hold F9 to record, release to transcribe.

PyTorch transcription voice-commands Whisper whisperx

Python

24 天前

lurub / RViewer

a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model

pyside6 Whisper pyside whisperx

Python

2 年前

superyhee / whisperx-on-aws-jumpstart

deploy whsiper on aws

Amazon Web Services cloudformation ec2 Whisper whisperx

Python

1 年前

emmanuelinfante / SubtitlesEveryone

Transcribe Like a Pro, Without Paying a Penny!

colab colab-notebook colaboratory 深度学习 deepl extract srt srt-subtitles subtitles subtitles-generator vtt baidu-api Whisper whisperx vad translator-app translators Google

Jupyter Notebook

7 个月前

austinwmille / orca

you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center

diarization nltk pyannote whisperx 大语言模型 huggingface

Python

2 个月前

mrhallonline / WhisperXTranscription4Researchers

This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).

diarization 机器学习自然语言处理 transcription whisper-ai whisperx asr speech-to-text

Jupyter Notebook

1 个月前