Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
-
Updated
Oct 5, 2022 - Python
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.
🧑🏻🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.
This project presents Hera, an Operating System level voice recognition package that understands voice commands to perform actions to simplify the user’s workflow. We propose a modernistic way of interacting with Linux systems, where the latency of conventional physical inputs are minimized through the use of natural language speech recognition.
ASR pytorch project
This is a Machine Learning project. This model takes video of person face as input and predicts the word. It uses tensorflow and keras for training the model. It uses Sequential models for trainning and predicting. It used relu and softmax as activation functions
The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.
Interface for automated transcription and time alignment of conversational interview data
Download speech datasets (English and non-English) for Automatic Speech Recognition
194999-Uyghur-Pronunciation-Dictionary
1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone
North American English Speech Dataset
Number Speech Dataset in Mandarin and Dialects
Dialect-Pronunciation-Dictionary
Korean Speech Dataset
Shanghai Dialect Speech Dataset
Chinese Wake-up Words Speech Dataset
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."