Repository navigation

#

speech-separation

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python
3232
6 天前
Python
2440
1 个月前

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python
1153
1 年前

A must-read paper for speech separation based on neural networks

TypeScript
801
8 天前

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python
724
2 年前

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python
574
2 年前

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

MATLAB
515
4 年前

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Python
497
2 年前
Python
468
1 年前

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

MATLAB
466
5 年前

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Python
446
3 年前

Tools for Speech Enhancement integrated with Kaldi

Python
419
2 年前

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML
370
2 个月前

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Jupyter Notebook
331
2 年前