Repository navigation
faster-whisper
- Website
- Wikipedia
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
faster_whisper GUI with PySide6
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
Real-time transcription using faster-whisper
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names.
The subtitles and translations are generated in real-time and displayed as pop-ups.
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
userscripts for mpv
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey
Real-time Speech To Text using Faster Whisper.