SpeechCache: Speech Understanding on Tiny Devices with A Learning Cache

This repository contains the source code for Mobisys'24 paper Leveraging cache to enable SLU on tiny devices by authors by Afsara Benazir, Zhiming Xu, and Felix Xiaozhu Lin.

Dependencies

The code is compatible with python 3.10. A list of dependencies can be installed via pip install -r requirements.txt.

Training

The .csv files are located in SLURP/csv. It contains a detailed list of entries combining columns from the original SLURP dataset and metadata.

To run a sample train script, please run slurp_train.py as follows:

python3 slurp_train.py --model_dir <your_model_path> --wav_path <your_wav_path>

Inference/Testing

The trained models have a metadata file ending in .pkl.gz that contains the index of the train samples. During testing, those samples are omitted.

To do inference, please run

python3 slurp_test.py --model_dir <your_model_path> --wav_path <your_wav_path>

Optional: To play with the optimizations mentioned in paper, please set --dynamic True for doing inference with dynamic threshold (c.f pg7 of manuscript). set `-in_domain True`` for using a pretrained slurp model (c.f pg7 of manuscript)

Models

All finetuned SLURP-C models used in the experiment can be found at here

Models for user study (in the wild evaluation) are here

Demo

demo_SC.mp4

Reference

@article{benazir2023leveraging,
  title={Leveraging cache to enable SLU on tiny devices},
  author={Benazir, Afsara and Xu, Zhiming and Lin, Felix Xiaozhu},
  journal={arXiv preprint arXiv:2311.18188},
  year={2023}
}

Please cite our paper if you find our work useful.

Acknowlegment

The code is adapted based on end-to-end-SLU, published in Speech Model Pre-training for End-to-End Spoken Language Understanding by Lugosch et al.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.idea		.idea
FSC		FSC
SLURP		SLURP
__pycache__		__pycache__
csv		csv
docs		docs
experiments		experiments
fluent_speech_commands_dataset		fluent_speech_commands_dataset
site		site
user_study		user_study
utils		utils
.DS_Store		.DS_Store
README.md		README.md
config.yaml		config.yaml
data.py		data.py
models.py		models.py
phoneme_list.txt		phoneme_list.txt
pyvenv.cfg		pyvenv.cfg
requirements.txt		requirements.txt
script.py		script.py
slurp_test.py		slurp_test.py
slurp_train.py		slurp_train.py
turn_on_the_lamp.wav		turn_on_the_lamp.wav
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeechCache: Speech Understanding on Tiny Devices with A Learning Cache

Dependencies

Training

Inference/Testing

Models

Demo

Reference

Acknowlegment

About

Releases

Packages

Contributors 2

Languages

afsara-ben/SpeechCache

Folders and files

Latest commit

History

Repository files navigation

SpeechCache: Speech Understanding on Tiny Devices with A Learning Cache

Dependencies

Training

Inference/Testing

Models

Demo

Reference

Acknowlegment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages