hi ShreeDevi Thanks.
I tested the 2 models that you have provided. The accuracy on samples without noise were about 98% but on scanned samples or captured images, were about 80%. but still it didn't work on different fonts. Could u send all files that needed for training models? I want fine tune the model with more fonts and diacritics . best regards On Friday, May 18, 2018 at 8:49:54 PM UTC+4:30, shree wrote: > > I have posted a couple of test models for Farsi at > https://github.com/Shreeshrii/tessdata_shreetest > > These have not been trained on text with diacritics as the normalization > and training process was giving error on the combining marks. > > Please give them a try and see if they provide better recognition for > numbers and text without combining marks. > > FYI, I do not know the Persian language so it is difficult for me to gauge > if results are ok or not. > > ShreeDevi > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/fe15cedc-0a2a-41fc-ac3c-b80df458a509%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.