[tesseract-ocr] Does the checkpoint_name contain the number of iterations

2023-09-20 Thread Des Bw
I couldn't understand what the numbers on the checkpoint_names are. I looked at this one: but clear to me. https://github.com/tesseract-ocr/tesseract/blob/3a7f5e4de459f4c64f36e08b18ce1b66b1fbc876/src/lstm/lstmtrainer.cpp#L410 -- You received this message because you are subscribed to the

Re: [tesseract-ocr] Does the checkpoint_name contain the number of iterations

2023-09-20 Thread Shree Devi Kumar
See https://tesseract-ocr.github.io/tessdoc/tess4/TrainingTesseract-4.00.html#iterations-and-checkpoints On Wed, Sep 20, 2023, 2:53 AM Des Bw wrote: > I couldn't understand what the numbers on the checkpoint_names are. > I looked at this one: but clear to me. > >

Re: [tesseract-ocr] Does the checkpoint_name contain the number of iterations

2023-09-20 Thread Des Bw
Thank you so much dear Shree. On Wednesday, September 20, 2023 at 4:57:52 PM UTC+3 shree wrote: > See > https://tesseract-ocr.github.io/tessdoc/tess4/TrainingTesseract-4.00.html#iterations-and-checkpoints > > On Wed, Sep 20, 2023, 2:53 AM Des Bw wrote: > >> I couldn't understand what the

[tesseract-ocr] Cutting the top layer is deteriorating the original training

2023-09-20 Thread Des Bw
The default traineddata for Amharic is pretty accurate except it misses a handful of characters. I have been emulating what Shree did to add the Norwegian Æ to the dataset. It actually worked like charm. The problem is: I cannot get nowhere near the accuracy of the original best model. -