Repository navigation

#

transcription

spotify/basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python
3856
3 个月前
abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
3608
4 天前

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte
2185
3 个月前
Rust
1839
21 小时前
bugbakery/audapolis

an editor for spoken-word audio with automatic transcription

TypeScript
1736
2 年前
sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

1627
1 天前

「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!

MDX
1273
22 天前
juanmc2005/diart

A python package to build AI-powered real-time audio applications

Python
1253
2 个月前

Simple GUI for ByteDance's Piano Transcription with Pedals

Nix
1247
20 天前

视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

Python
986
1 年前

turnkey self-hosted offline transcription and diarization service with llm summary

Python
836
7 个月前

Generate subtitles, summaries, and chapters from videos in seconds

PHP
829
3 个月前

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Swift
826
7 个月前

Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

JavaScript
790
2 年前

Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

Python
749
9 天前

A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

Swift
741
2 年前