autosubs

Autosubs is a library for Automatic Speech Recognition using a modified version of Listen-Attend-Spell. The biggest changes are the addition of three bi-directional LSTM layers to the beginning of the network, dropout between all LSTM layers, and the addition of scaling to the dot-product attention mechanism. The WSJ training process entailed pretraining the decoder for ~15 epochs, and then training hundreds of epochs for between 24-48 hours on 1 Tesla V100.

Installation

The full conda environment is exported in autosubs_env.yml You can install this running: conda env create -f autosubs_env.yml

Training and Inference

Set your configuration in src/config.yaml
- All hyperparameters and general execution parameters are exposed here for tuning.
Training is started by running: python runner.py
Inference is started by running: python inference.py

Data

Datasets belong in the data folder
For KNNW, these include:
- knnw-720p.tar.gz
- knnw_en_mono.wav
- knnw_en_sub.srt
- knnw_en.log_spectrogram.npy
- knnw_en_sub.csv

Checkpointing

Models can be trained/evaluated from checkpoint by placing weights in the checkpoints directory, and setting config.yaml's checkpoint path to the weights file.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
data		data
decoder_ckpts		decoder_ckpts
final_data		final_data
full_ckpts		full_ckpts
src		src
.gitignore		.gitignore
Final_Result_Visualizations.ipynb		Final_Result_Visualizations.ipynb
Final_Spectral_Visualizations.ipynb		Final_Spectral_Visualizations.ipynb
LICENSE		LICENSE
README.md		README.md
autosubs_env.yml		autosubs_env.yml
cuml-tSNE_ignore.ipynb		cuml-tSNE_ignore.ipynb
final_project_wav2vec2.ipynb		final_project_wav2vec2.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

autosubs

Installation

Training and Inference

Data

Checkpointing

About

Releases

Packages

Contributors 4

Languages

License

garrisonhess/autosubs

Folders and files

Latest commit

History

Repository files navigation

autosubs

Installation

Training and Inference

Data

Checkpointing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages