Repository navigation

#

transcription

Zackriya-Solutions/meeting-minutes

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/ is meetly ai

C++
7647
1 天前
abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
4840
4 小时前
spotify/basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python
4284
1 个月前

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte
2683
2 个月前
Rust
2026
7 小时前
sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

1882
19 天前
bugbakery/audapolis

an editor for spoken-word audio with automatic transcription

TypeScript
1770
2 年前
juanmc2005/diart

A python package to build AI-powered real-time audio applications

Python
1470
8 个月前

Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

Python
1418
1 天前

Simple GUI for ByteDance's Piano Transcription with Pedals

Nix
1359
2 个月前

「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!

MDX
1320
6 个月前

Self-hosted AI audio transcription

Go
1318
19 天前

视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

Python
1065
2 年前

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Swift
923
7 天前

Generate subtitles, summaries, and chapters from videos in seconds

PHP
847
4 个月前

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python
828
4 个月前