Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

Thesis report

Errata

In print1 version of report

Fig 5.1. DS-Strided-24 result is missing
Fig 5.1. No-information-rate should be 11.5% instead of 10%. Did not take class-imbalance into account
Fig 2.10. Labels EffNet and ShuffleNet swapped
Fig 5.3. Missing description of model used. Uses Stride-DS-24
Table 4.1. Nesterov momentum shows NaN. Should be 0.9

Citing

You can use the following BibTeX entry

@mastersthesis{esc_micro_cnn_nordby2019,
    title={Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks},
    author={Jon Nordby},
    year=2019,
    month=5,
    school={Norwegian University of Life Sciences},
    url={http://hdl.handle.net/11250/2611624}
}

Keywords

Wireless Sensor Networks, Embedded Systems
Edge Computing, Edge Machine Learning
Noise classification, Environmental Sound Classification (ESC), Urbansounds
Tensorflow, Keras, librosa

Abstract

Noise is a growing problem in urban areas, and according to the WHO is the second environmental cause of health problems in Europe. Noise monitoring using Wireless Sensor Networks are being applied in order to understand and help mitigate these noise problems. It is desirable that these sensor systems, in addition to logging the sound level, can indicate what the likely sound source is. However, transmitting audio to a cloud system for classification is energy-intensive and may cause privacy issues. It is also critical for widespread adoption and dense sensor coverage that individual sensor nodes are low-cost. Therefore we propose to perform the noise classification on the sensor node, using a low-cost microcontroller.

Several Convolutional Neural Networks were designed for the STM32L476 low-power microcontroller using the Keras deep-learning framework, and deployed using the vendor-provided X-CUBE-AI inference engine. The resource budget for the model was set at maximum 50% utilization of CPU, RAM, and FLASH. 10 model variations were evaluated on the Environmental Sound Classification task using the standard Urbansound8k dataset.

The best models used Depthwise-Separable convolutions with striding for downsampling, and were able to reach 70.9% mean 10-fold accuracy while consuming only 20% CPU. To our knowledge, this is the highest reported performance on Urbansound8k using a microcontroller. One of the models was also tested on a microcontroller development device, demonstrating the classification of environmental sounds in real-time.

These results indicate that it is computationally feasible to classify environmental sound on low-power microcontrollers. Further development should make it possible to create wireless sensor-networks for noise monitoring with on-edge noise source classification.

Run experiments

Setting up

Recommend using miniconda for

conda env create -f environment.yml
conda activate microesc

As an altenative, one can use pip

#pip install -r requirements.txt

Preprocess audio files into features

python3 preprocess.py

Check that the environment is working. This will run training process, but only for a few minutes.

python3 jobs.py --check

Running

Train the models

python3 jobs.py

Evaluate the resulting models

python3 test.py

Plot the results

python3 report.py

Name		Name	Last commit message	Last commit date
Latest commit History 379 Commits
data/results/20190511-2353-0e60		data/results/20190511-2353-0e60
experiments		experiments
firmware/perftest		firmware/perftest
microesc		microesc
presentation		presentation
report		report
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
.travis.yml		.travis.yml
Experiments.ipynb		Experiments.ipynb
README.md		README.md
TODO.md		TODO.md
braindump.md		braindump.md
environment.yml		environment.yml
jobs.py		jobs.py
model.py		model.py
models.csv		models.csv
models16k.csv		models16k.csv
plan.html		plan.html
preprocess.py		preprocess.py
report.py		report.py
requirements.txt		requirements.txt
run.py		run.py
test.py		test.py
thesis.planner		thesis.planner
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

Thesis report

Errata

Citing

Keywords

Abstract

Run experiments

Setting up

Running

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

jonnor/ESC-CNN-microcontroller

Folders and files

Latest commit

History

Repository files navigation

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

Thesis report

Errata

Citing

Keywords

Abstract

Run experiments

Setting up

Running

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages