LSTM: Training - Image not trainable #590

Shreeshrii · 2016-12-19T12:47:54Z

mkdir -p ~/tesstutorial/sanvedic
lstmtraining -U ~/tesstutorial/vedic/san.unicharset
--script_dir ../langdata --debug_interval 0
--learning_rate 10e-5
--net_spec '[1,0,0,1 Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx384 O1c5000]'
--net_mode 192
--perfect_sample_delay 19
--model_output ~/tesstutorial/sanvedic/base
--train_listfile ~/tesstutorial/vedic/san.training_files.txt
--eval_listfile /tesstutorial/vedic/san.training_files.txt
--max_iterations 50000
&>/tesstutorial/sanvedic/basetrain.log

Setting unichar properties
Setting properties for script Common
Setting properties for script Latin
Setting properties for script Devanagari
Unichar 2306=र्त्स्न्ये->र्त्स्न्ये is too long to encode!!
Warning: given outputs 5000 not equal to unicharset of 5018.
Num outputs,weights in serial:
1,0,0,1:1, 0
Num outputs,weights in serial:
C5,5:25, 0
Ft16:16, 416
Total weights = 416
[C5,5Ft16]:16, 416
Mp3,3:16, 0
Lfys64:64, 20736
Lfx128:128, 98816
Lrx128:128, 131584
Lfx384:384, 787968
Fc5018:5018, 1931930
Total weights = 2971450
Built network:[1,0,0,1[C5,5Ft16]Mp3,3Lfys64Lfx128Lrx128Lfx384Fc5018] from request [1,0,0,1 Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx384 O1c5000]
Training parameters:
Debug interval = 0, weights = 0.1, learning rate = 0.0001, momentum=0.9
Loaded 828/828 pages (0-828) of document /home/shree/tesstutorial/vedic/san.AA_NAGARI_SHREE_L1.exp0.lstmf
Loaded 691/691 pages (0-691) of document /home/shree/tesstutorial/saneval/san.Aksharyogini2.exp0.lstmf
Loaded 1023/1023 pages (0-1023) of document /home/shree/tesstutorial/vedic/san.Sanskrit_2003.exp0.lstmf
Loaded 957/957 pages (0-957) of document /home/shree/tesstutorial/vedic/san.e-Nagari_OT.exp0.lstmf
Loaded 1060/1060 pages (0-1060) of document /home/shree/tesstutorial/vedic/san.FreeSans.exp0.lstmf
Loaded 691/691 pages (0-691) of document /home/shree/tesstutorial/saneval/san.Amiko.exp0.lstmf
Loaded 1213/1213 pages (0-1213) of document /home/shree/tesstutorial/vedic/san.Siddhanta-cakravat.exp0.lstmf
Loaded 1191/1191 pages (0-1191) of document /home/shree/tesstutorial/vedic/san.Sahadeva.exp0.lstmf
Loaded 1291/1291 pages (0-1291) of document /home/shree/tesstutorial/vedic/san.Santipur_OT_Medium.exp0.lstmf
Loaded 1115/1115 pages (0-1115) of document /home/shree/tesstutorial/vedic/san.Lohit_Devanagari.exp0.lstmf
Loaded 1210/1210 pages (0-1210) of document /home/shree/tesstutorial/vedic/san.Nakula.exp0.lstmf
Found AVX
Found SSE
Loaded 1188/1188 pages (0-1188) of document /home/shree/tesstutorial/vedic/san.Siddhanta-Calcutta.exp0.lstmf
Loaded 1211/1211 pages (0-1211) of document /home/shree/tesstutorial/vedic/san.Siddhanta.exp0.lstmf
Loaded 1214/1214 pages (0-1214) of document /home/shree/tesstutorial/vedic/san.Siddhanta-Nepali.exp0.lstmf
Loaded 1157/1157 pages (0-1157) of document /home/shree/tesstutorial/vedic/san.Uttara.exp0.lstmf
Image too large to learn!! Size = 2594x48
Image not trainable
Image too large to learn!! Size = 2758x48
Image not trainable
Image too large to learn!! Size = 2621x48
Image not trainable
At iteration 100/100/103, Mean rms=0.95%, delta=57.759%, char train=100.161%, word train=100%, skip ratio=3%, New worst char error = 100.161 wrote checkpoint

Shreeshrii · 2016-12-19T12:52:46Z

The images used were created by text2image with training text with word wrap which ran for full width of page.

Is there a limit to size of images for training?

Should training text only to be 70-120 characters wide?

Shreeshrii · 2017-01-09T08:44:53Z

This is the opposite case of image being too small.

Built network:[1,0,0,1[C5,5Ft16]Mp3,3Lfys64Lfx128Lrx128Lfx256Fc104] from request [1,0,0,1 Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256 O1c5000]
Training parameters:
  Debug interval = 0, weights = 0.1, learning rate = 0.0001, momentum=0.9
Loaded 151/151 pages (1-151) of document /home/shree/tesstutorial/trado/ara.Traditional_Arabic.exp0.lstmf
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
At iteration 100/100/104, Mean rms=6.004%, delta=48.481%, char train=138.814%, word train=100%, skip ratio=4%,  New worst char error = 138.814 wrote checkpoint.

Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
At iteration 200/200/207, Mean rms=5.654%, delta=40.983%, char train=119.407%, word train=100%, skip ratio=3.5%,  New worst char error = 119.407 wrote checkpoint.

Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!

amitdo · 2017-01-09T09:50:31Z

Is there a limit to size of images for training?

https://github.com/tesseract-ocr/tesseract/blob/ce76d1c569/lstm/lstmrecognizer.cpp#L266

// Maximum width of image to train on.
const int kMaxImageWidth = 2560;

Shreeshrii · 2017-01-09T09:58:13Z

Then shouldn't text2image ensure that images are made to fit that width. ShreeDevi

…

____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Mon, Jan 9, 2017 at 3:20 PM, Amit D. ***@***.***> wrote: Is there a limit to size of images for training? https://github.com/tesseract-ocr/tesseract/blob/ce76d1c569/ lstm/lstmrecognizer.cpp#L266 https://github.com/tesseract-ocr/tesseract/blob/ce76d1c569/ lstm/lstmrecognizer.cpp#L266 // Maximum width of image to train on. const int kMaxImageWidth = 2560; — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE2_oyLDWu_QZFaYM9Kn1mCaW7ExTo-_ks5rQgLtgaJpZM4LQsPF> .

amitdo · 2017-01-09T10:00:28Z

Yes :-)

Shreeshrii · 2017-01-09T10:01:47Z

https://github.com/tesseract-ocr/tesseract/blob/831e161066d28a0320d7061c8403f638515b8801/training/text2image.cpp#L82 // Width of output image (in pixels). INT_PARAM_FLAG(xsize, 3600, "Width of output image");

Shreeshrii · 2017-01-17T13:25:15Z

The default value for images output by text2image can be reduced during running tesstrain.sh by modifying tesstrain_utils.sh

    common_args+=" --leading=${LEADING} --xsize 2550"

Shreeshrii · 2017-03-15T06:54:40Z

@theraysmith

Ray,

// Maximum width of image to train on.
const int kMaxImageWidth = 2560;

I have some old tif/box pairs . the image width is 4000.

Will training quality be degraded if changing above constant to 4000 in order to use them?

Shreeshrii · 2017-03-15T06:55:50Z

Also can this be changed during runtime with a variable or do I need to recompile tesseract with the higher value?

Shreeshrii · 2017-05-11T08:51:18Z

Changing tesstrain_utils.sh for

common_args+=" --leading=${LEADING} --xsize 2550"
fixes this.

hanikh · 2017-08-08T06:48:38Z

@Shreeshrii how can the problem of image being too small be fixed?

Shreeshrii · 2017-08-09T03:31:42Z

Usually this happens for just a few lines of an image - tesseract splits the input image into separate image per line.

It could be when layout analysis has wrongly segmented the page or a line has been detected as having hundreds of diacritics.

If it is just a few messages, you could ignore.

@theraysmith Any update regarding new line detection algorithm?

hanikh · 2017-08-09T04:00:39Z

actually, it's not just a few messages. I am trying to train tesseract to recognize plate licence, and the prepared training_text is just like a plate licence. something like this: ۵۴ ۷۲۸ ب ۱۴ each line includes one of these patterns. I received a lot of these errors and the training process finished with error rate equal to zero. no training! would you please help me to figure out what the problem is?

…

On Wed, Aug 9, 2017 at 8:02 AM, Shreeshrii ***@***.***> wrote: Reopened #590 <#590>. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AZFiARloL1SxhhVagWDBpNPsl8wmxGH3ks5sWSgzgaJpZM4LQsPF> .

amitdo · 2017-08-09T05:27:43Z

Image too large to learn!! Size = 2758x48
Image not trainable

@hanikh, please paste a short example for the errors you get.

theraysmith · 2017-08-10T18:46:52Z

The exact error message would greatly help diagnose the problem.

…

On Tue, Aug 8, 2017 at 10:28 PM, Amit D. ***@***.***> wrote: Image too large to learn!! Size = 2758x48 Image not trainable @hanikh <https://github.com/hanikh>, please paste a short example for the errors you get. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AL056TBM3518EXdJE7-KA44mvwgN2Mx2ks5sWUNhgaJpZM4LQsPF> .

-- Ray.

hanikh · 2017-08-12T08:15:26Z

I will send the exact error message as soon as possible. but, meanwhile I have faced a more important problem. I finetuned tesseract for farsi (40 fonts on 6000 text lines) and I got worse result than the original tesserct on the trained fonts. what is the problem? the training_text is not big enough? (this is a different project and not related to the licence plate) On Thu, Aug 10, 2017 at 11:17 PM, theraysmith <notifications@github.com> wrote:

…

The exact error message would greatly help diagnose the problem. On Tue, Aug 8, 2017 at 10:28 PM, Amit D. ***@***.***> wrote: > Image too large to learn!! Size = 2758x48 > Image not trainable > > @hanikh <https://github.com/hanikh>, please paste a short example for the > errors you get. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#590# issuecomment-321156352>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AL056TBM3518EXdJE7- KA44mvwgN2Mx2ks5sWUNhgaJpZM4LQsPF> > . > -- Ray. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AZFiAQuqzKOKd8bmnzUcFlsc6bPQth3Oks5sW1AzgaJpZM4LQsPF> .

roozgar · 2017-08-12T08:30:53Z

@hanikh
did you used v4?
i saw this problem on cube for persian..

hanikh · 2017-08-12T09:57:22Z

@theraysmith would you please help me, how many text line is appropriate?
thanks

Shreeshrii · 2017-08-12T10:35:04Z

I finetuned tesseract for farsi (40 fonts on 6000 text lines)

I think this maybe too much for finetuning. I noticed that tesstrain.sh is limiting text2image generated images to just 3 pages - that would be only max 150 lines per font. With that much input, you can try replace a layer training to see if that gets you better results. ShreeDevi

…

____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Sat, Aug 12, 2017 at 3:27 PM, hanikh ***@***.***> wrote: @theraysmith <https://github.com/theraysmith> would you please help me, how many text line is appropriate? thanks — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE2_o4rV-DPLTiSAqgSTy9dJdA3Oek6iks5sXXcJgaJpZM4LQsPF> .

Shreeshrii · 2017-08-12T11:58:06Z

@hanikh I suggest to wait till Ray updates the langdata and also uploads the new version of unichar_extractor. Befroe that training for RTL languages may not be give useful results. ShreeDevi

…

____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sat, Aug 12, 2017 at 4:04 PM, ShreeDevi Kumar <shreeshrii@gmail.com> wrote:

> I finetuned tesseract for farsi (40 fonts on 6000 text lines) I think this maybe too much for finetuning. I noticed that tesstrain.sh is limiting text2image generated images to just 3 pages - that would be only max 150 lines per font. With that much input, you can try replace a layer training to see if that gets you better results. ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sat, Aug 12, 2017 at 3:27 PM, hanikh ***@***.***> wrote: > @theraysmith <https://github.com/theraysmith> would you please help me, > how many text line is appropriate? > thanks > > — > You are receiving this because you modified the open/close state. > Reply to this email directly, view it on GitHub > <#590 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AE2_o4rV-DPLTiSAqgSTy9dJdA3Oek6iks5sXXcJgaJpZM4LQsPF> > . >

hanikh · 2017-08-12T12:18:47Z

Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Compute CTC targets failed!
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
Image too small to scale!! (3x48 vs min width of 3)
Line cannot be recognized!!
Image not trainable
2 Percent improvement time=0, best error was 2.167 @ 14
At iteration 14/1100/20884, Mean rms=0.049%, delta=0%, char train=0%, word train=0%, skip ratio=1798.6%, New best char error = 0 wrote best model:/home/fanasa/tesstutorial/fastuned_from_fas/fastuned-plates0_14.lstm wrote checkpoint.

Finished! Error rate = 0
this is the error I got during training for licence plates.

theraysmith · 2017-08-13T04:14:29Z

Initial problem: (Image too small to scale) Those images are ridiculously small at 3x48 pixels. Something is going wrong somewhere with the images. Are they oriented vertically? The input scaling scales the height to 48, whatever it starts as, so it looks like your textlines are vertical. Fine tuning problem: The problem is most likely too many iterations. It will hone its accuracy to whatever training data you give it if you run it for too many iterations. See how few iterations are used in the training tutorial for fine tuning.

…

On Sat, Aug 12, 2017 at 5:19 AM, hanikh ***@***.***> wrote: Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Compute CTC targets failed! Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable Image too small to scale!! (3x48 vs min width of 3) Line cannot be recognized!! Image not trainable 2 Percent improvement time=0, best error was 2.167 @ 14 At iteration 14/1100/20884, Mean rms=0.049%, delta=0%, char train=0%, word train=0%, skip ratio=1798.6%, New best char error = 0 wrote best model:/home/fanasa/tesstutorial/fastuned_from_fas/fastuned-plates0_14.lstm wrote checkpoint. Finished! Error rate = 0 this is the error I got during training for licence plates. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AL056ZvLnyg_aC1mUg2gH34puAGpWdOOks5sXZhHgaJpZM4LQsPF> .

-- Ray.

Shreeshrii · 2017-08-13T05:07:49Z

Ray, I have seen line too small to be recognized when building box/tiff pairs using tesstrain.sh - it is usually related to 'nnn diacritics found' - so it may be related to accents being treated as a separate line. Regarding finetuning, I have experimented a lot with Devanagari - with smaller number of iterations, the reported error rate is higher. And it takes tens of thosands of iterations for it to get more accuracy on training set - not sure of its effect on samples it has not seen. - see https://github.com/Shreeshrii/tess4training/blob/master/README.md ShreeDevi

…

____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sun, Aug 13, 2017 at 9:44 AM, theraysmith <notifications@github.com> wrote:

Initial problem: (Image too small to scale) Those images are ridiculously small at 3x48 pixels. Something is going wrong somewhere with the images. Are they oriented vertically? The input scaling scales the height to 48, whatever it starts as, so it looks like your textlines are vertical. Fine tuning problem: The problem is most likely too many iterations. It will hone its accuracy to whatever training data you give it if you run it for too many iterations. See how few iterations are used in the training tutorial for fine tuning. On Sat, Aug 12, 2017 at 5:19 AM, hanikh ***@***.***> wrote: > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Compute CTC targets failed! > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > Image too small to scale!! (3x48 vs min width of 3) > Line cannot be recognized!! > Image not trainable > 2 Percent improvement time=0, best error was 2.167 @ 14 > At iteration 14/1100/20884, Mean rms=0.049%, delta=0%, char train=0%, word > train=0%, skip ratio=1798.6%, New best char error = 0 wrote best > model:/home/fanasa/tesstutorial/fastuned_from_ fas/fastuned-plates0_14.lstm > wrote checkpoint. > > Finished! Error rate = 0 > this is the error I got during training for licence plates. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#590# issuecomment-321977639>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AL056ZvLnyg_ aC1mUg2gH34puAGpWdOOks5sXZhHgaJpZM4LQsPF> > . > -- Ray. — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE2_o3ztjvMQKBue5JIqMU9Qrfx4ng_Mks5sXng2gaJpZM4LQsPF> .

hanikh · 2017-08-14T07:11:55Z

for the fine tuning problem: the error-rate reaches 0.017 at about 80000 iterations. so with few iterations like in tutorial, a low error-rate like 0.01 can not be achieved. so you think fine tuning is a wrong solution and I should try replacing some layers? as I said before I am trying to train for 40 Persian fonts and they are so common. On Sun, Aug 13, 2017 at 9:38 AM, Shreeshrii <notifications@github.com> wrote:

…

Ray, I have seen line too small to be recognized when building box/tiff pairs using tesstrain.sh - it is usually related to 'nnn diacritics found' - so it may be related to accents being treated as a separate line. Regarding finetuning, I have experimented a lot with Devanagari - with smaller number of iterations, the reported error rate is higher. And it takes tens of thosands of iterations for it to get more accuracy on training set - not sure of its effect on samples it has not seen. - see https://github.com/Shreeshrii/tess4training/blob/master/README.md ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Sun, Aug 13, 2017 at 9:44 AM, theraysmith ***@***.***> wrote: > Initial problem: (Image too small to scale) > Those images are ridiculously small at 3x48 pixels. Something is going > wrong somewhere with the images. > Are they oriented vertically? The input scaling scales the height to 48, > whatever it starts as, so it looks like your textlines are vertical. > > Fine tuning problem: > The problem is most likely too many iterations. It will hone its accuracy > to whatever training data you give it if you run it for too many > iterations. > See how few iterations are used in the training tutorial for fine tuning. > > On Sat, Aug 12, 2017 at 5:19 AM, hanikh ***@***.***> wrote: > > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Compute CTC targets failed! > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > Image too small to scale!! (3x48 vs min width of 3) > > Line cannot be recognized!! > > Image not trainable > > 2 Percent improvement time=0, best error was 2.167 @ 14 > > At iteration 14/1100/20884, Mean rms=0.049%, delta=0%, char train=0%, > word > > train=0%, skip ratio=1798.6%, New best char error = 0 wrote best > > model:/home/fanasa/tesstutorial/fastuned_from_ > fas/fastuned-plates0_14.lstm > > wrote checkpoint. > > > > Finished! Error rate = 0 > > this is the error I got during training for licence plates. > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub > > <#590# > issuecomment-321977639>, > > or mute the thread > > <https://github.com/notifications/unsubscribe-auth/AL056ZvLnyg_ > aC1mUg2gH34puAGpWdOOks5sXZhHgaJpZM4LQsPF> > > . > > > > > > -- > Ray. > > — > You are receiving this because you modified the open/close state. > Reply to this email directly, view it on GitHub > <#590# issuecomment-322020794>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AE2_ o3ztjvMQKBue5JIqMU9Qrfx4ng_Mks5sXng2gaJpZM4LQsPF> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#590 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AZFiAZCIts02B7U5JsRtn2DYu86ZBuyhks5sXoTKgaJpZM4LQsPF> .

hanikh · 2017-08-16T06:31:00Z

@Shreeshrii would you please explain about the new traineddata file? where can the lang.lstm-unicharset file be found ? how can combine_lang_model be used? thanks On Mon, Aug 14, 2017 at 11:44 AM, Hanieh Khosravi <hani.khosravi@gmail.com> wrote:

…

for the fine tuning problem: the error-rate reaches 0.017 at about 80000 iterations. so with few iterations like in tutorial, a low error-rate like 0.01 can not be achieved. so you think fine tuning is a wrong solution and I should try replacing some layers? as I said before I am trying to train for 40 Persian fonts and they are so common. On Sun, Aug 13, 2017 at 9:38 AM, Shreeshrii ***@***.***> wrote: > Ray, > > I have seen line too small to be recognized when building box/tiff pairs > using tesstrain.sh - it is usually related to 'nnn diacritics found' - so > it may be related to accents being treated as a separate line. > > Regarding finetuning, I have experimented a lot with Devanagari - with > smaller number of iterations, the reported error rate is higher. And it > takes tens of thosands of iterations for it to get more accuracy on > training set - not sure of its effect on samples it has not seen. - see > https://github.com/Shreeshrii/tess4training/blob/master/README.md > > > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Sun, Aug 13, 2017 at 9:44 AM, theraysmith ***@***.***> > wrote: > > > > Initial problem: (Image too small to scale) > > Those images are ridiculously small at 3x48 pixels. Something is going > > wrong somewhere with the images. > > Are they oriented vertically? The input scaling scales the height to 48, > > whatever it starts as, so it looks like your textlines are vertical. > > > > Fine tuning problem: > > The problem is most likely too many iterations. It will hone its > accuracy > > to whatever training data you give it if you run it for too many > > iterations. > > See how few iterations are used in the training tutorial for fine > tuning. > > > > On Sat, Aug 12, 2017 at 5:19 AM, hanikh ***@***.***> > wrote: > > > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Compute CTC targets failed! > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > Image too small to scale!! (3x48 vs min width of 3) > > > Line cannot be recognized!! > > > Image not trainable > > > 2 Percent improvement time=0, best error was 2.167 @ 14 > > > At iteration 14/1100/20884, Mean rms=0.049%, delta=0%, char train=0%, > > word > > > train=0%, skip ratio=1798.6%, New best char error = 0 wrote best > > > model:/home/fanasa/tesstutorial/fastuned_from_ > > fas/fastuned-plates0_14.lstm > > > wrote checkpoint. > > > > > > Finished! Error rate = 0 > > > this is the error I got during training for licence plates. > > > > > > — > > > You are receiving this because you were mentioned. > > > Reply to this email directly, view it on GitHub > > > <#590# > > issuecomment-321977639>, > > > or mute the thread > > > <https://github.com/notifications/unsubscribe-auth/AL056ZvLnyg_ > > aC1mUg2gH34puAGpWdOOks5sXZhHgaJpZM4LQsPF> > > > . > > > > > > > > > > > -- > > Ray. > > > > — > > You are receiving this because you modified the open/close state. > > Reply to this email directly, view it on GitHub > > <#590 (comment) > comment-322020794>, > > or mute the thread > > <https://github.com/notifications/unsubscribe-auth/AE2_o3ztj > vMQKBue5JIqMU9Qrfx4ng_Mks5sXng2gaJpZM4LQsPF> > > > . > > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#590 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AZFiAZCIts02B7U5JsRtn2DYu86ZBuyhks5sXoTKgaJpZM4LQsPF> > . >

Shreeshrii · 2017-08-16T07:06:09Z

where can the lang.lstm-unicharset file be found ?

combine_tessdata -u lang.traineddata lang.

It will create lang.* files , including the unicharset.

You can use dawg2wordlist to see the wordlist used

how can combine_lang_model be used?

 combine_lang_model    \
 --input_unicharset  ../tesstutorial/sanskrit2003/san/san.unicharset  \
 --script_dir "../langdata"   \
 --words "../langdata/san/san.wordlist" \
 --numbers "../langdata/san/san.numbers"   \
 --puncs "../langdata/san/san.punc" \
 --output_dir ../tesstutorial/sanskrit2003   \
 --lang "san"     --pass_through_recoder \
     --version_str "4.0.0alpha-20170816 sanskrit2003"

For RTL languages, there is an additional flag. Please see https://github.com/tesseract-ocr/tesseract/blob/master/training/tesstrain_utils.sh for details.

I used a hand-edited unicharset, because the unicharset generated from the current training process is old style. You should wait for @theraysmith to update the unichar_extractor and other langdata files.

ghost · 2018-07-07T01:33:27Z

@Shreeshrii

I haven't had much success in finetuning.

Give me examples of things you failed to do with finetuning.
Also a sample of the training text that you chosen to use in training.

Fixes Image too large to learn!! Size = 2594x48 Image not trainable See tesseract-ocr#590 (comment) for related discussion

Shreeshrii · 2018-08-29T18:10:08Z

#590 (comment) by Ray Smith

Initial problem: (Image too small to scale)
Those images are ridiculously small at 3x48 pixels. Something is going
wrong somewhere with the images.
Are they oriented vertically? The input scaling scales the height to 48,
whatever it starts as, so it looks like your textlines are vertical.

This bug is still there.

Shreeshrii · 2018-09-22T20:58:46Z


Error in pixScaleAreaMap: pixd too small
Error in pixClone: pixs not defined
Error in pixCopyText: pixd not defined
Error in pixCopyInputFormat: pixd not defined
Scaling pix of size 35, 4548 by factor 0.0105541 made null pix!!
Error in pixGetWidth: pix not defined
Error in pixGetHeight: pix not defined
Bad pix from ImageData!
Line cannot be recognized!!
Image not trainable

with version

tesseract 4.0.0-beta.4-158-g02f9d
 leptonica-1.76.0
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.4 : libopenjp2 2.3.0

anonynamja · 2018-09-28T13:30:09Z

I had the same problem as the thread OP:

Image too large to learn!! Size = 2594x48 Image not trainable

I resolved it with this suggestion above #590 (comment)

Changing tesstrain_utils.sh for

common_args+=" --leading=${LEADING} --xsize 2550"
fixes this.

Was this the correct approach?

msklvsk · 2018-12-29T17:40:37Z

Image too large to learn!! hasn’t gone. You get it with a small enough font or with 48-pixel-tall input layer even using --xsize 2550.

Image too large to learn!! Size = 4859x48

So the question is why does this constraint exist and whether it can be dropped or set to, say, 6000? Or should one prepare shorter lines after all? What would be the correct solution?

Shreeshrii · 2019-04-10T08:09:16Z

Similar to #590 (comment)

Error in pixScaleAreaMap: pixd too small
Error in pixClone: pixs not defined
Error in pixCopyText: pixd not defined
Error in pixCopyInputFormat: pixd not defined
Scaling pix of size 35, 4477 by factor 0.0107215 made null pix!!
Error in pixGetWidth: pix not defined
Error in pixGetHeight: pix not defined
Bad pix from ImageData!
Line cannot be recognized!!
Image not trainable

tesseract 4.1.0-rc1-255-g332a1
leptonica-1.76.0
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.4 : libopenjp2 2.3.0

kamrapooja · 2019-06-19T07:52:15Z

Hi Shree,

I am also getting same error
"Image too large to learn!! Size = 2881x36
Image not trainable"
Max size is not defined in tesstrain.sh. And as per document default size is 3600.
Why this issue is coming?

Mohamed209 · 2019-11-06T11:09:21Z

does anyone know what is the recommended image size which bounding boxes are extracted from it to retrain tesseract , if so shall i retrain with fixed sizes or with variety of images sizes

lnutimura · 2020-03-06T19:31:54Z

@Shreeshrii

I'm also experiencing this same error while trying to fine tune an existing model:

[...]
Loaded 1/1 lines (1-1) of document data/bar-ground-truth/test-0-049.exp0.lstmf
Image too large to learn!! Size = 3316x48
Image not trainable
Loaded 1/1 lines (1-1) of document data/bar-ground-truth/test-1-026.exp0.lstmf
Image too large to learn!! Size = 3316x48
[...]

As suggested in #590, I modified tesstrain_utils.sh by changing the X_SIZE variable but it didn't help.

Also, after sorting the .tif files by dimension in descending order, I noticed that the first three files aren't even that large:

In fact, none of my images have a width of 3316 px.
I tried to resize them w/ ImageMagick but it didn't help as well.

Why is tesseract getting these different values for dimensions?

Shreeshrii · 2020-03-07T02:25:01Z

Please upload a sample lstmf file which is getting the error for checking.

…

On Sat, Mar 7, 2020, 01:01 Luan Utimura ***@***.***> wrote: @Shreeshrii <https://github.com/Shreeshrii> I'm also experiencing this same error while trying to fine tune an existing model: [...] Loaded 1/1 lines (1-1) of document data/bar-ground-truth/test-0-049.exp0.lstmf Image too large to learn!! Size = 3316x48 Image not trainable Loaded 1/1 lines (1-1) of document data/bar-ground-truth/test-1-026.exp0.lstmf Image too large to learn!! Size = 3316x48 [...] As suggested in #590 <#590 (comment)>, I modified tesstrain_utils.sh by changing the X_SIZE variable but it didn't help. Also, after sorting the .tif files by dimension in descending order, I noticed that the first three files aren't even that large: [image: files] <https://user-images.githubusercontent.com/10110243/76115228-85bada00-5fc6-11ea-910a-10a2857f90b5.png> In fact, none of my images have a width of 3316 px. I tried to resize them w/ ImageMagick but it didn't help as well. Why is tesseract reading these values for dimensions? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#590?email_source=notifications&email_token=ABG37IZZDVJS3ZPIOFFFOODRGFFSZA5CNFSM4C2CYPC2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEOCRTAQ#issuecomment-595925378>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABG37I4VFECHBC5BNUEF6V3RGFFSZANCNFSM4C2CYPCQ> .

lnutimura · 2020-03-07T03:13:25Z

This one, for example:
bar-0-032.exp0.lstmf.zip

Thanks for replying.

Shreeshrii · 2020-03-07T04:45:31Z

@lnutimura Thanks for the lstmf file.

I unpacked it using an experimental feature by @stweil to check the image file in it. You are right, the image size is 2351x32.

I think the image is being resized for 48 height as part of training and that is increasing its width to 3627 and leading to the error. I had thought that the resized image maybe kept in lstmf file, but that is not the case.

Please take a look at the network spec that you are using for training. Usually the image height is either 36 or 48 in them. e.g. from https://tesseract-ocr.github.io/tessdoc/Data-Files-in-tessdata_fast

Version string:4.00.00alpha:amh:synth20170629
LSTM training info:Network str:[1,36,0,1Ct3,3,16Mp3,3Lfys48Lfx96Lrx96Lfx192O1c1], flags=41,
iteration=6112200, sample_iteration=6112270, null_char=284, learning_rate=0.001, momentum=0.5, adam_beta=0.999

Version string:4.00.00alpha:Arabic:synth20170629:[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx128O1c1]
LSTM training info:Network str:[1,48,0,1Ct3,3,16Mp3,3Lfys64Lfx96Lrx96Lfx128O1c1], flags=41,
iteration=5524000, sample_iteration=5532770, null_char=2, learning_rate=0.001, momentum=0.5, adam_beta=0.999

@stweil, @amitdo Suggestions for fixing this??

Shreeshrii · 2020-03-07T04:57:10Z

From https://tesseract-ocr.github.io/tessdoc/VGSLSpecs.html

Model string input and output
A neural network model is described by a string that describes the input spec, the output spec and the layers spec in between. Example:

[1,0,0,3 Ct5,5,16 Mp3,3 Lfys64 Lfx128 Lrx128 Lfx256 O1c105]

The first 4 numbers specify the size and type of the input, and follow the TensorFlow convention for an image tensor: [batch, height, width, depth]. Batch is currently ignored, but eventually may be used to indicate a training mini-batch size. Height and/or width may be zero, allowing them to be variable. A non-zero value for height and/or width means that all input images are expected to be of that size, and will be bent to fit if needed. Depth needs to be 1 for greyscale and 3 for color. As a special case, a different value of depth, and a height of 1 causes the image to be treated from input as a sequence of vertical pixel strips. NOTE THAT THROUGHOUT, x and y are REVERSED from conventional mathematics, to use the same convention as TensorFlow.

lnutimura · 2020-03-07T04:58:05Z

I'm using the por.traineddata from tessdata_best, so I guess it's:

Version string:4.00.00alpha:por:synth20170629
LSTM training info:Network str:[1,36,0,1Ct3,3,16Mp3,3Lfys48Lfx96Lrx96Lfx192O1c1], flags=41,
iteration=1850300, sample_iteration=1850422, null_char=118, learning_rate=0.001, momentum=0.5, adam_beta=0.999

Shreeshrii · 2020-03-07T05:01:37Z

The network spec for tessdata_best is not the same as that for tessdata_fast. I don't think we have the info for all tessdata_best languages.

Shreeshrii · 2020-03-07T05:06:39Z

Also see #590 (comment) by Ray

The input scaling scales the height to 48,
whatever it starts as,

lnutimura · 2020-03-07T05:12:21Z

Oh, I see. That makes sense now.
Do you think there's a way to bypass this?
I'm generating a lot of images for each .tif using a script you provided in this issue, so manually changing them would be a little hard-working, but "doable"...

Shreeshrii · 2020-03-07T06:48:03Z

Are your original images 2 column, or facing pages of book? If so it will be helpful to split them before generating line images.

lnutimura · 2020-03-07T16:09:01Z

They're tables that occupy the entire space of the page.
I'll group some of the columns and generate separate files for them before running the script. Let's see if it works better. Anyway, thanks for the help!

EDIT: I was able to finish the training without any error! Now it's just a matter of finding ways to improve the fine tuning.

ciobania · 2020-04-03T01:13:39Z

Hi there,

I'm having the same error, with 1 tif, 1000 iterations, however, the lstmtraining keeps running.
I'm running on a Jetson Nano, using:

tesseract 4.1.1-rc2-21-gf4ef
 leptonica-1.78.0
  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0
 Found libarchive 3.2.2 zlib/1.2.11 liblzma/5.2.2 bz2lib/1.0.6 liblz4/1.7.1

I'm training on a single image, just to understand the mechanism, and learn about it.
I'm using a scanned receipt, as an example, 600dpi. Identity, and imagemagick says it's 1696x3930.

I'm confused a bit by this, as the script still runs, and the error rate keeps dropping.
I've read the tutorials and examples, and the scripts, and it's all too much for now, as I've been at it for about 2-3 weeks now.

Do I need to create single line images from each image I have? (~3000)
would it help if I create ground-truth text files - for the entire image, only for a single line?
some of the words in my images are not found in the eng.training_files.txt, as such would it speed up/help if I add them?
is there a way to do fine tuning with my own images and my own eng.training_files.txt data, without running tesstrain.sh?

I could not find details about how to train/fina tune with own tif/box. It's unclear to me if I need to generate the ground-truth data as well, do I still need to fiddle/fix the box files, etc.

Sorry if I asked too many questions, I've invested so much time in it, and I'm not sure where exactly to these questions fit - forum, new issue, Google Group?

Later edit:
Funny enough, the combined new model seems to be worse than the best trained one, probably because of the image resolution/error.

akmalkadi · 2021-03-27T17:27:15Z

I am getting:
Image too large to learn!! Size = 4024x36

Even though all my images are 1900x17.

Shreeshrii · 2021-03-28T02:15:44Z

All images are resized to 36 or 48 pixels height based on network spec used. So looks like your resized image maybe too big.

Shreeshrii changed the title ~~LSTM: Training - Image too large to learn~~ LSTM: Training - Image not trainable Jan 9, 2017

Shreeshrii closed this as completed May 11, 2017

Shreeshrii reopened this Aug 9, 2017

Shreeshrii mentioned this issue Aug 16, 2017

Format of train_listfile #841

Closed

noahmetzger pushed a commit to noahmetzger/tesseract that referenced this issue Jul 31, 2018

Change default width for images output by text2image

9504556

Fixes Image too large to learn!! Size = 2594x48 Image not trainable See tesseract-ocr#590 (comment) for related discussion

Shreeshrii mentioned this issue Aug 30, 2018

text2image - wrong box files for short lines #1883

Open

anonynamja mentioned this issue Sep 25, 2018

image too large from Langdata_lstm #1922

Closed

amitdo mentioned this issue Dec 31, 2019

kMaxImageWidth = 2560，but xsize is 3600 (default) #2837

Closed

ShroukMansour mentioned this issue Feb 23, 2020

Image too small to scale!! (1x48 vs min width of 3) Shreeshrii/tess5train-fonts#11

Closed

davidb1 mentioned this issue Jun 2, 2020

Image too small to scale!! (2x48 vs min width of 3) - vertical lstm training #3001

Closed

stweil mentioned this issue Jun 25, 2020

ImageData::PreScale gives unnecessary errors if pixScale fails #3025

Closed

amitdo added the training label Mar 18, 2021

LSTM: Training - Image not trainable #590

LSTM: Training - Image not trainable #590

Comments

Shreeshrii commented Dec 19, 2016

Shreeshrii commented Dec 19, 2016

Shreeshrii commented Jan 9, 2017

amitdo commented Jan 9, 2017 • edited Loading

Shreeshrii commented Jan 9, 2017 via email

amitdo commented Jan 9, 2017

Shreeshrii commented Jan 9, 2017 via email

Shreeshrii commented Jan 17, 2017 • edited Loading

Shreeshrii commented Mar 15, 2017

Shreeshrii commented Mar 15, 2017

Shreeshrii commented May 11, 2017

hanikh commented Aug 8, 2017

Shreeshrii commented Aug 9, 2017

hanikh commented Aug 9, 2017 via email

amitdo commented Aug 9, 2017

theraysmith commented Aug 10, 2017 via email

hanikh commented Aug 12, 2017 via email

roozgar commented Aug 12, 2017

hanikh commented Aug 12, 2017

Shreeshrii commented Aug 12, 2017 via email

Shreeshrii commented Aug 12, 2017 via email

hanikh commented Aug 12, 2017

theraysmith commented Aug 13, 2017 via email

Shreeshrii commented Aug 13, 2017 via email

hanikh commented Aug 14, 2017 via email

hanikh commented Aug 16, 2017 via email

Shreeshrii commented Aug 16, 2017 • edited Loading

ghost commented Jul 7, 2018

Shreeshrii commented Aug 29, 2018

Shreeshrii commented Sep 22, 2018

anonynamja commented Sep 28, 2018

msklvsk commented Dec 29, 2018

Shreeshrii commented Apr 10, 2019

kamrapooja commented Jun 19, 2019

Mohamed209 commented Nov 6, 2019

lnutimura commented Mar 6, 2020 • edited Loading

Shreeshrii commented Mar 7, 2020 via email

lnutimura commented Mar 7, 2020

Shreeshrii commented Mar 7, 2020

Shreeshrii commented Mar 7, 2020

lnutimura commented Mar 7, 2020

Shreeshrii commented Mar 7, 2020 • edited Loading

Shreeshrii commented Mar 7, 2020

lnutimura commented Mar 7, 2020

Shreeshrii commented Mar 7, 2020

lnutimura commented Mar 7, 2020 • edited Loading

ciobania commented Apr 3, 2020 • edited Loading

akmalkadi commented Mar 27, 2021

Shreeshrii commented Mar 28, 2021

amitdo commented Jan 9, 2017 •

edited

Loading

Shreeshrii commented Jan 17, 2017 •

edited

Loading

Shreeshrii commented Aug 16, 2017 •

edited

Loading

lnutimura commented Mar 6, 2020 •

edited

Loading

Shreeshrii commented Mar 7, 2020 •

edited

Loading

lnutimura commented Mar 7, 2020 •

edited

Loading

ciobania commented Apr 3, 2020 •

edited

Loading