You are training from scratch. It will take thousands of iterations. Try fine-tuning.
On Thu, Apr 16, 2020, 19:51 Piyush Chandra <[email protected]> wrote: > Hi Shree, > > Thanks for replying. > > So shall I remove them from text file and create a unicharset file after > that or do I have do do something while creating the lstmf files? > > Also, Will this affect the training if I don't remove this? I saw that > training was continuing but the best char error was 100 even after 5000 > iteration and went to 96 after 7800 iteration. weird. :-\ > > On Thursday, 16 April 2020 19:26:15 UTC+5:30, shree wrote: >> >> U+0965 ॥ e0 a5 a5 DEVANAGARI DOUBLE DANDA >> >> On Thu, Apr 16, 2020, 19:25 Shree Devi Kumar <[email protected]> wrote: >> >>> U+200D e2 80 8d ZERO WIDTH JOINER >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/36920c00-50b9-4d19-a018-8f1275cc481c%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/36920c00-50b9-4d19-a018-8f1275cc481c%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUqB5g%2Bzo-svYzjanZBwN6UE7j4kWTFkwQxG9ccM5REbg%40mail.gmail.com.

