Repository navigation

#

voice-detection

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

MATLAB
864
4 年前

An audio/acoustic activity detection and audio segmentation tool

Python
818
10 个月前

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

C
414
3 个月前

Gecko - A Tool for Effective Annotation of Human Conversations

JavaScript
298
3 年前

A statistical model-based Voice Activity Detection

Jupyter Notebook
194
7 年前

Efficient voice activity detection algorithm using long-term speech information

MATLAB
46
8 年前

Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.

Python
36
4 年前

End to end AWS SageMaker application for detecting the AWS Polly voice in an audio recording using Gluon and MXNet.

Jupyter Notebook
6
5 年前

Spoofing voice detection : 2nd YAICON

Python
2
2 年前

The Poetry Pronunciation Learning App is an interactive AI-powered tool that helps users practice and improve their pronunciation of poems. It uses real-time speech recognition, voice activity detection, and fuzzy word matching to provide instant feedback on spoken verses.

Python
2
1 个月前

End-to-end pipeline for training a custom keyword detection model with TensorFlow & TFLite expor

Python
2
2 个月前

this is a p5js experiment that uses voice detection and cursor movement to multiply creative content in a variety of colours

JavaScript
2
7 年前

TranscribeTube is a Python tool that transcribes and generates subtitles for videos from local files or YouTube links using Hugging Face models. It features an interactive Gradio web interface, allowing users to easily upload videos, select languages, and download subtitles in SRT format.

Python
2
1 年前

Voice-Detection

HTML
1
3 年前

Config files for my GitHub profile.

Jupyter Notebook
0
2 年前

using a simple convolution neural network to classify voices based on the existence of wake word

Jupyter Notebook
0
1 年前

A database of challenging voice utterances collected by the Biometrics Vision and Computing (BVC) group.

0
6 个月前