ASR-hybrid-decoding

An updated version of this toolkit now lives on our lab's github

ASR-hybrid-decoding

This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs. The output is a mix of in-vocabulary words and phoneme sequences. This decoding is suitable for systems with only a small dictionary available and for further recovery of OOV words.

Theory:

Brief description of the hybrid decoding system can be found in a paper and generally follows an approach in an earlier paper

Requirements:

For this to work you'll need kaldi speech recognition toolkit installed

This expansion of kaldi was been tested on the following databases:

How to run:

First run kaldi recipies and then on top of them you can run hybrid decoding as presented here. The file structure in this repository is the same as kaldi file structure, so it suffices to copy scripts from this repository to corresponding folders in your kaldi system build. After that, run run_hybrid_decoding.sh script, which will build the hybrid decoding graph and perform the decoding.

LibriSpeech setup:

OOV_list_1000.txt has a selection of 1000 words to perform as OOVs for this database. For a detailed description of how they were chosen, see subsection 3.1 in paper

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LibriSpeech/s5		LibriSpeech/s5
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASR-hybrid-decoding

Theory:

Requirements:

How to run:

LibriSpeech setup:

About

Releases

Packages

Languages

License

kate-egorova/ASR-hybrid-decoding

Folders and files

Latest commit

History

Repository files navigation

ASR-hybrid-decoding

Theory:

Requirements:

How to run:

LibriSpeech setup:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages