OCR_MNIST

The purpose of this repository

Familiarization with Text Recognition using Recurrent Neural Networks. The accuracy of the networks is not state of the art, but the main objective was to create a minimal size network for reasonable results. The number of parameters for the final version has about 85K parameters.

Dataset

Images from the generated dataset. For a generation, only the vanilla MNIST dataset was used.

Ground truth strings were encoded following the rules established in this work:

Numbers now have the prefix "s_"
Empty space is encoded with the symbol "-"
Space between numbers is encoded as "|"

Predictions

Prediction of the network on the test dataset is saved in an HTML file using the Jinja2 template. Here is an example of the saved file from the Results directory:

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dataset		Dataset
HTML_template		HTML_template
Model		Model
dataset_tools		dataset_tools
images_readme		images_readme
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config_training_testing.json		config_training_testing.json
dataset.hdf5		dataset.hdf5
environment.yml		environment.yml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR_MNIST

The purpose of this repository

Dataset

Predictions

About

Releases

Packages

Languages

License

BioWar/OCR_MNIST

Folders and files

Latest commit

History

Repository files navigation

OCR_MNIST

The purpose of this repository

Dataset

Predictions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages