Repository navigation

audio-to-text

Website
Wikipedia

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

人工智能 audio-to-text Go subtitles sveltekit transcription Whisper ui Web app speech-recognition speech-to-text stt Web

Svelte

2588

149

4 天前

SakiRinn / LiveCaptions-Translator

Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.

livecaptions Windows speech-to-text audio-to-text API api-integration translation real-time

1144

13 天前

Saik0s / Whisperboard

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

openai iOS speech-recognition speech-to-text SwiftUI transcription audio-to-text tca Whisper whisper-cpp

Swift

898

1 年前

URUWorks / TeroSubtitler

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

editor Linux macOS subtitles Windows free captions Open Source transcription audio-to-text FFmpeg mpv Whisper yt-dlp 人工智能 blu-ray

Pascal

366

1 天前

Kabanosk / whisper-website

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

FastAPI openai speech-to-text Whisper Python uvicorn Website audio-to-text subtitles subtitles-generator Open Source Hacktoberfest

Python

311

6 个月前

HenestrosaDev / audiotext

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

Python speech-recognition audio-to-text speech-to-text subtitles-generator whisperx FFmpeg

Python

213

10 个月前

javedali99 / audio-to-text-transcription

This repository contains a Python script that allows users to download the audio from a YouTube video, transcribe it into text, detect the language and save the transcription in txt file automatically.

audio-to-text Open Source openai Python transcription Whisper audio YouTube

Python

148

4 个月前

bai0012 / Whisper_auto2lrc

Use Whisper to convert audio files into LRC subtitle files in bulk. 使用whisper实现将音频文件批量转换为lrc字幕文件

audio-to-text Python Whisper Windows PyTorch

Python

4 个月前

rudymohammadbali / Whisper-Transcriber

Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.

GUI openai Whisper audio-to-text stt

Python

1 年前

persiandataset / PersianSpeech

Persian ASR dataset

dataset asr persian-speech-recognition audio-to-text

2 年前

xndien2004 / Speech-to-text-Realtime-with-extension

"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and accessibility. Easy to install and use on popular browsers.

audio-to-text Django Google 云 openai-api realtime

Jupyter Notebook

9 个月前

KostasEreksonas / Audio-transcriber

Simple Python audio transcriber using OpenAI's Whisper speech recognition model

audio openai openai-whisper text transcription Whisper audio-to-text Python pip YouTube youtube-dl

Python

5 个月前

Education-Victory / whisper-webui

WebUI for Whisper API

audio-to-text transcription

Python

1 年前

thinh-vu / ur_audio_sub

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

audio-to-text speech-recognition Whisper

Jupyter Notebook

2 年前

inferless / whisper-large-v3

State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. gpu: T4 | collections: ["CTranslate2"]

audio-to-text

Python

4 个月前

GabrieleRisso / aiyu

core shell functions building blocks for advanced AI pipelines

人工智能 audio-to-text gpt-3 stable-diffusion text-to-audio text-to-image text-to-speech tts Whisper

2 年前

markydoodled / Journal.it

A SwiftUI App For People Who Need To Take Down Important Information Quickly.

SwiftUI texteditor camera iOS macOS audio-editing photo-editing audio-to-text Swift

Swift

2 年前

gisty-org / chrome-extension

Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platforms (for google meet, it directly extracts the captions) and sends to flask api for summarization.

JavaScript audio-to-text Google Meet Chrome 插件

JavaScript

2 年前

AzizBenAli / YouTube-AI-Assistant

Develop a python application that allows you to extract valuable insights, engage in meaningful conversations, and explore video content in a whole new way.

agents 聊天机器人 retrieval-augmented-generation Streamlit youtube-api audio-to-text embeddings openai pineconedb conversational-agents conversational-bots memory generative-ai

Python

2 年前

gabrielsenadev / audioinsight

AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

audio-processing audio-to-text cloudflare-ai full-stack webdev Whisper

TypeScript

1 年前