Repository navigation

#

silero-vad

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

C
397
1 个月前

A sophisticated real-time voice assistant that seamlessly integrates speech recognition, AI reasoning, and neural text-to-speech synthesis. It is designed for natural conversational interactions with advanced tool-calling capabilities.

Python
43
2 个月前

A real-time Voice Activity Detection (VAD) library for iOS and macOS using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.

Swift
34
4 个月前

In this repository, I show you how to use SILERO VAD with ONNX-WEB runtime to run the VAD compeletely in the browser.

JavaScript
24
8 个月前
Swift
22
9 个月前

VAD is a cross-platform Dart binding for the VAD JavaScript library. This package provides access to a Voice Activity Detection (VAD) system, allowing Flutter applications to start and stop VAD-based listening and handle various VAD events.

C++
19
1 个月前

Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech

C++
16
1 年前

Audio transcription using mlx whisper and vad silence processing

Python
15
10 个月前

Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, and performant Silero VAD implementation! Always open for PRs.

C#
13
4 个月前

Python script for detect silences with Silero-VAD and transcribing with the whisper AI model.

Python
12
4 个月前

This repo provides an addon that can perform VAD model reasoning in nodes and electric environments, based on cmake-js and Fastdeploy. Silero VAD is a pre-trained enterprise-grade Voice Activity Detector.

C++
11
2 年前

C++ implementation of real-time Voice Activity Detection (VAD) using Silero models with ONNX Runtime and WebRTC Audio Processing. Provides precise voice segmentation and cross-platform XCFramework support.

C++
10
4 个月前

Test comparison of two VAD models with English and multilingual speech datasets

Python
8
1 个月前

Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client to server for live transcription and optional translation. Supports CLI and Python API.

Python
7
8 天前

A demo project to test silero-vad using REST API

Python
6
3 年前