Repository navigation

#

timit

mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Python
2385
3 年前

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML
365
4 个月前

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

303
3 年前

CTC end -to-end ASR for timit and 863 corpus.

Python
218
5 年前

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Python
139
4 年前

Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.

Python
89
6 年前

Speech recognition on the TIMIT (or any other) dataset

Python
42
7 年前

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Perl
38
7 年前

Pytorch based phoneme recognition (TIMIT phoneme classification)

Python
34
7 年前

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Perl
33
7 年前

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

Python
19
7 年前

Attention-based end-to-end ASR on TIMIT in PyTorch

Python
17
3 年前

Extract mfcc vectors and phones from TIMIT dataset

Shell
16
2 年前

Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.

Python
11
5 年前

Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.

Python
11
5 年前

Python/numpy/pandas convenience wrapper for the TIMIT database.

Jupyter Notebook
11
6 年前