BenchmarkSTT

About

This is a command line tool for benchmarking Automatic Speech Recognition engines.

It is designed for non-academic production environments, and prioritises ease of use and relative benchmarking over scientific procedure and high-accuracy absolute scoring.

Because of the wide range of languages, algorithms and audio characteristics, no single STT engine can be expected to excel in all circumstances. For this reason, this tool places responsibility on the users to design their own benchmarking procedure and to decide, based on the combination of test data and metrics, which engine is best suited for their particular use case.

Usage examples

Returns the number of word insertions, deletions, replacements and matches for the hypothesis transcript compared to the reference:

benchmarkstt --reference reference.txt --hypothesis hypothesis.txt --diffcounts

Returns the Word Error Rate after lowercasing both reference and hypothesis. This normlization improves the accuracy of the Word Error Rate as it removes diffs that might otherwise be considered errors:

benchmarkstt -r reference.txt -h hypothesis.txt --wer --lowercase

Returns a visual diff after applying all the normalization rules specified in the config file:

benchmarkstt -r reference.txt -h hypothesis.txt --worddiffs --config conf

Further information

This is a collaborative project to create a library for benchmarking AI/ML applications. It was created in response to the needs of broadcasters and providers of Access Services to media organisations, but anyone is welcome to contribute. The group behind this project is the EBU's AI and Metadata group.

Currently the group is focussing on Speech-to-Text, but it will consider creating benchmarking tools for other AI/ML services.

For general information about this project, including the motivations and guiding principles, please see the project wiki

To install and start using the tool, go to the documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 310 Commits
.github/workflows		.github/workflows
docs		docs
olddocs		olddocs
resources		resources
src/benchmarkstt		src/benchmarkstt
tests/benchmarkstt		tests/benchmarkstt
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
CHANGELOG.rst		CHANGELOG.rst
Dockerfile		Dockerfile
LICENCE.md		LICENCE.md
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.rst		README.rst
VERSION		VERSION
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BenchmarkSTT

About

Usage examples

Further information

About

Releases

Packages

Contributors 10

Languages

License

ebu/benchmarkstt

Folders and files

Latest commit

History

Repository files navigation

BenchmarkSTT

About

Usage examples

Further information

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 10

Languages

Packages