voice-activity-detection

Here are 280 public repositories matching this topic...

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Updated Jun 11, 2026
Python

noisetorch / NoiseTorch

Star

Real-time microphone noise suppression on Linux.

linux voice pulseaudio hacktoberfest noise-reduction voice-activity-detection voice-activated noise-suppression hacktoberfest2023

Updated Jan 13, 2025
Go

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Jun 12, 2026
Jupyter Notebook

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Mar 26, 2026
Python

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Jun 12, 2026
Python

jim-schwoebel / voice_datasets

Star

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

data voice voice-commands dataset voice-recognition noise voice-chat datasets voice-control voice-conversion voice-assistant voice-activity-detection voice-synthesis audio-datasets voice-computing voice-dataset voice-datasets audio-dataset

Updated Jun 6, 2024

FluidInference / FluidAudio

Star

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

audio macos swift ios real-time avfoundation nvidia vad automatic-speech-recognition speech-to-text ane speaker-recognition asr speaker-diarization voice-activity-detection coreml speaker-identification speaker-embedding parakeet

Updated Jun 14, 2026
Swift

TEN-framework / ten-vad

Star

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

audio real-time voice-commands speech voice-recognition vad automatic-speech-recognition speech-processing conversational-ai voice-activity-detection voice-agent silero-vad

Updated Feb 2, 2026
C

The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, and conversations without anyone knowing. Built with Tauri for native performance, just 10MB. Completely undetectable in video calls, screen shares, and recordings.

react desktop-app rust typescript gemini openai speech-to-text stealth grok claude voice-activity-detection undetectable tauri tailwindcss ai-assistant llm shadcn cluely-alternative grok-4

Updated Jan 14, 2026
TypeScript

ricky0123 / vad

Sponsor

Star

Voice activity detector (VAD) for the browser with a simple API

typescript web speech-to-text web-audio-api voice-activity-detection onnxruntime silero-vad

Updated Jan 30, 2026
TypeScript

juanmc2005 / diart

Star

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Feb 12, 2025
Python

BingLingGroup / autosub

Star

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

subtitles substation-alpha audio-segmentation xfyun cloud-speech-api voice-activity-detection baidu-api xunfei-api

Updated Dec 21, 2023
Python

k2-fsa / sherpa-ncnn

Star

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

kotlin python c go csharp cpp speech-recognition vad asr voice-activity-detection

Updated Oct 20, 2025
C++

coqui-ai / open-speech-corpora

Star

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jun 6, 2024

ggeop / Python-ai-assistant

Star

Python AI assistant 🧠

Updated Nov 17, 2024
Python

sandrohanea / whisper.net

Sponsor

Star

Whisper.net. Speech to text made simple using Whisper Models

translation cross-platform dotnet dotnetcore speech-recognition vad speech-to-text voice-activity-detection

Updated Jun 14, 2026
C#

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Updated Mar 12, 2026
Python

soniqo / speech-swift

Star

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

macos swift ios text-to-speech tts speech-recognition asr mlx speaker-diarization voice-activity-detection coreml speech-enhancement on-device neural-engine speech-to-speech apple-silicon

Updated Jun 14, 2026
Swift

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated May 14, 2026
Python

Improve this page

Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

voice-activity-detection

Here are 280 public repositories matching this topic...

modelscope / FunASR

noisetorch / NoiseTorch

pyannote / pyannote-audio

snakers4 / silero-vad

smacke / ffsubsync

jim-schwoebel / voice_datasets

FluidInference / FluidAudio

TEN-framework / ten-vad

iamsrikanthnani / pluely

ricky0123 / vad

juanmc2005 / diart

BingLingGroup / autosub

k2-fsa / sherpa-ncnn

coqui-ai / open-speech-corpora

ggeop / Python-ai-assistant

sandrohanea / whisper.net

ina-foss / inaSpeechSegmenter

soniqo / speech-swift

jtkim-kaist / VAD

amsehili / auditok

Improve this page

Add this topic to your repo