A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
-
Updated
Jan 24, 2025 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Automagically synchronize subtitles with video.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
A python package to build AI-powered real-time audio applications
Python AI assistant 🧠
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
An audio/acoustic activity detection and audio segmentation tool
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Voice Activity Detection based on Deep Learning & TensorFlow
Auto transcribe tool based on whisper
On-device voice activity detection (VAD) powered by deep learning
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
The codebase for Data-driven general-purpose voice activity detection.
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
A collection of basic python modules for spoken natural language processing
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."