Repository navigation

speech-separation

Website
Wikipedia

A PyTorch-based Speech Toolkit

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition speaker-diarization speaker-verification PyTorch huggingface transformers language-model 深度学习

Python

10296

1542

7 天前

espnet / espnet

End-to-End Speech Processing Toolkit

深度学习 end-to-end chainer PyTorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization text-to-speech

Python

9386

2315

2 天前

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

audio 深度学习 noise-suppression PyTorch speech speech-enhancement speech-separation

Python

3232

263

6 天前

asteroid-team / asteroid

The PyTorch-based audio source separation toolkit for researchers

source-separation speech-separation audio-separation speech-enhancement 深度学习 PyTorch pretrained-models

Python

2440

437

1 个月前

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection voice-cloning speech-separation

1352

148

1 年前

maum-ai / voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

source-separation audio-separation speech-separation PyTorch voicefilter

Python

1153

233

1 年前

JusperLee / Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Bukkit speech-enhancement speech-separation

TypeScript

801

136

8 天前

kaituoxu / Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

speech-separation source-separation audio-separation PyTorch

Python

724

156

2 年前

Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

speech-enhancement speech-processing speech-separation PyTorch pretrained-model Bukkit noise-reduction denoising audio reproducible-research speech

Python

574

157

2 年前

anicolson / DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

resnet Tensorflow speech-enhancement residual-networks speech-separation source-separation Keras attention

MATLAB

515

125

4 年前

JusperLee / Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

PyTorch speech-separation 深度学习

Python

497

2 年前

microsoft / UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

PyTorch speech-recognition speech-processing speech diarization speech-separation speaker-verification

Python

468

1 年前

gemengtju / Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

speech-separation speech-processing speech-analysis 深度学习深度神经网络 signal-processing

MATLAB

466

5 年前

JusperLee / Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

PyTorch 深度学习 rnn-model speech-separation

Python

446

3 年前

double22a / speech_dataset

The dataset of Speech Recognition

asr speech-recognition 深度学习 dataset audio 深度神经网络 wav speech-to-text speech tts speech-synthesis voice-conversion speech-translation speech-enhancement speech-separation text-to-speech automatic-speech-recognition

420

8 个月前

funcwj / setk

Tools for Speech Enhancement integrated with Kaldi

kaldi speech-enhancement speech speech-separation

Python

419

2 年前

speechbrain / speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

深度学习 speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speechrecognition 神经网络 neural-networks timit speech-analysis

HTML

370

2 个月前

posenhuang / deeplearningsourceseparation

Deep Recurrent Neural Networks for Source Separation

speech-separation MATLAB 深度学习 audio-separation source-separation speech-denoising rnn

MATLAB

369

133

4 年前

etzinis / sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

深度学习 speech speech-separation audio

Jupyter Notebook

331

2 年前

seanwood / gcc-nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

speech-separation speech-enhancement nmf real-time real-time-processing speech speech-processing low-latency 机器学习 gcc Jupyter Notebook speaker

Python

324

134

6 年前