Repository navigation

#

transcription

Zackriya-Solutions/meeting-minutes

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/ is meetly ai

C++
7079
8 天前
abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
4394
1 个月前
spotify/basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python
4167
6 天前

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte
2588
5 天前
Rust
1987
10 小时前
sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

1815
7 天前
bugbakery/audapolis

an editor for spoken-word audio with automatic transcription

TypeScript
1759
2 年前
juanmc2005/diart

A python package to build AI-powered real-time audio applications

Python
1406
6 个月前

Simple GUI for ByteDance's Piano Transcription with Pedals

Nix
1317
9 小时前

「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!

MDX
1311
5 个月前

视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

Python
1046
2 年前

Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

Python
955
18 小时前

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Swift
898
1 年前

Generate subtitles, summaries, and chapters from videos in seconds

PHP
844
3 个月前

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python
800
3 个月前