Repository navigation

#

audio-to-text

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte
2588
4 天前

Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.

C#
1144
13 天前

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

Swift
898
1 年前

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

Pascal
366
1 天前

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

Python
311
6 个月前

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python
213
10 个月前

This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.

Python
148
4 个月前

Use Whisper to convert audio files into LRC subtitle files in bulk. 使用whisper实现将音频文件批量转换为lrc字幕文件

Python
65
4 个月前

Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.

Python
56
1 年前

"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and accessibility. Easy to install and use on popular browsers.

Jupyter Notebook
37
9 个月前

Simple Python audio transcriber using OpenAI's Whisper speech recognition model

Python
34
5 个月前

WebUI for Whisper API

Python
30
1 年前

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

Jupyter Notebook
16
2 年前

State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. gpu: T4 | collections: ["CTranslate2"]

Python
15
4 个月前

A SwiftUI App For People Who Need To Take Down Important Information Quickly.

Swift
13
2 年前

Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platforms (for google meet, it directly extracts the captions) and sends to flask api for summarization.

JavaScript
11
2 年前

Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content in a whole new way.

Python
11
2 年前

AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

TypeScript
8
1 年前