Repository navigation

#

Whisper

Created by OpenAI

发布于 August 2021

openai/whisper
openai.com

相关主题

机器学习人工智能

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

ggml-org/whisper.cpp
C++
39345
2 天前

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python
15043
7 天前

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python
14245
11 天前

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Python
11794
3 天前

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python
9862
5 天前

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Python
7549
3 小时前

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook
4585
1 年前
Swift
4519
4 天前

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

Python
4503
1 个月前

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python
4461
21 天前

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook
4400
1 个月前

Mac app for crushing tech interviews with AI

Swift
4196
3 个月前

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python
3829
3 个月前
embarklabs/embark

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

JavaScript
3796
9 个月前

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/

C++
3733
几秒前
abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Python
3606
4 天前
Grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

Java
3435
8 个月前

🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python

Python
3253
4 个月前