Repository navigation

#

speech-translation

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python
15436
13 小时前

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Python
12173
6 天前
Python
4146
4 个月前

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python
1391
1 年前

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Python
603
2 年前

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

586
1 年前

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.

TypeScript
389
3 小时前

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

Python
217
7 个月前

Zero -- A neural machine translation system

Python
153
2 年前

The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"

Python
66
5 天前

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python
64
3 年前

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

Python
63
1 年前

Repository containing the open source code of works published at the FBK MT unit.

Python
48
1 个月前

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Python
38
3 年前

List of direct speech-to-speech translation papers.

37
3 年前

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Python
36
2 年前