#

asr

Here are 1,098 public repositories matching this topic...

robmsmt / SpeechLoop

Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?

python speech speech-recognition speech-to-text asr speech-analysis asr-benchmark speechrecognition speech-api asr-model

Updated Oct 5, 2022
Python

BScUniversityCollaborations / automatic-speech-recognition

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

lizunowa / project-asr-metrics

🧑🏻‍🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.

asr asr-benchmark

Updated Jun 8, 2021
Jupyter Notebook

HeyHera / Hera

This project presents Hera, an Operating System level voice recognition package that understands voice commands to perform actions to simplify the user’s workflow. We propose a modernistic way of interacting with Linux systems, where the latency of conventional physical inputs are minimized through the use of natural language speech recognition.

python scikit-learn nlu spacy kivy tts asr wake-word-detection sgd-classifier vosk nix-tts

Updated Jul 12, 2022
Python

maximkm / DLA_ASR_HW

ASR pytorch project

transformers pytorch lm beam-search asr asr-model bpe

Updated Oct 16, 2022
Python

jevil25 / Lip-Read-ML-Model

This is a Machine Learning project. This model takes video of person face as input and predicts the word. It uses tensorflow and keras for training the model. It uses Sequential models for trainning and predicting. It used relu and softmax as activation functions

machine-learning tensorflow asr

Updated Aug 8, 2023
Jupyter Notebook

kingabzpro / hindiSpeechPro-Automatic-Speech-Recognization

The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.

transformer speech-recognition whisper asr hindi-language wav2vec2

Updated Nov 18, 2023
Jupyter Notebook

Forced-Alignment-and-Vowel-Extraction / fave-asr

Interface for automated transcription and time alignment of conversational interview data

linguistics asr sociolinguistics

Updated Apr 22, 2024
Python

alekseevskaia / audio_attack

asr adversarial-attacks carlini-wagner

Updated Jan 29, 2024
Jupyter Notebook

marks038 / Test

Test Repo

test test1 calculators asr

Updated Feb 23, 2024

ZYancey / ASR-Jukebox

A Spotify Remote that operates using an ML Powered Automated Speech Recognization and Intent Detection Pipeline

spacy asr spotipy

Updated Nov 5, 2023
Python

Rumeysakeskin / Speech-Datasets-for-ASR

Download speech datasets (English and non-English) for Automatic Speech Recognition

speech-synthesis speech-recognition speech-to-text speech-processing asr speech-dataset audio-datasets voice-datasets common-voice-dataset voxforge-dataset

Updated Jan 22, 2023
Jupyter Notebook

Nexdata-AI / 194999-Uyghur-Pronunciation-Dictionary

194999-Uyghur-Pronunciation-Dictionary

speech-recognition pronunciation-dictionary asr uyghur

Updated Aug 8, 2024

Nexdata-AI / 1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone

1044-Hours-Minnan-Dialect-Speech-Data-by-Mobile-Phone

speech-recognition speech-to-text minnan asr

Updated Aug 8, 2024

Nexdata-AI / 201-Hours-North-American-English-Speech-Data-by-Mobile-Phone-and-PC

North American English Speech Dataset

audio deep-learning speech tts speech-synthesis dataset speech-recognition automatic-speech-recognition speech-to-text asr asr-benchmark

Updated Aug 8, 2024

Nexdata-AI / 592-People-Number-Speech-Data-in-Mandarin-and-Dialects-by-Mobile-Phone

Number Speech Dataset in Mandarin and Dialects

audio deep-neural-networks deep-learning speech speech-synthesis dataset wav speech-recognition speech-to-text asr asr-model

Updated Aug 8, 2024

Nexdata-AI / 87166-Minnan-Dialect-Pronunciation-Dictionary

Dialect-Pronunciation-Dictionary

text lexicon speech-to-text pronunciation-dictionary asr

Updated Aug 8, 2024

Nexdata-AI / 197-Hours-Korean-Speech-Data-by-Mobile-Phone_Reading

Korean Speech Dataset

audio deep-learning speech dataset speech-recognition speech-to-text asr

Updated Aug 8, 2024

Nexdata-AI / 1030-Hours-Shanghai-Dialect-Speech-Data-by-Mobile-Phone

Shanghai Dialect Speech Dataset

audio deep-learning speech tts speech-recognition automatic-speech-recognition speech-to-text asr

Updated Aug 8, 2024

Nexdata-AI / 200-People-Chinese-Wake-up-Words-Speech-Data-by-Mobile-Phone

Chinese Wake-up Words Speech Dataset

audio deep-learning speech dataset speech-recognition speech-to-text asr asr-model

Updated Aug 8, 2024

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."