python, pytorch, captcha recognize, cnn This is learning project to understand how machine learning works. Aim - create and tech network to do text on picture recognition.
Whole process split by few steps:
- We should grab/download some images to use them as train/validate set, please use
grabber.py
as an entry point, inside file you can tweak how much data do you need - Your downloaded pictures should be described in the
desc.json
where you put picture name and exact text it contains - Run
image_transform.py
to do image normalization, it will remove picture noise, convert to black/white and crop to appropriate size - Run
neuro_image.py
to actually start teaching your network (pay attention, currently network storing commented out) - Run
model.py
to load pre-trained network and try to recognize real picture
Didn't try to make it as a finally solution, so all files probably requires minimal tweaks.