Re: [tesseract-ocr] train more fonts on trained model fas in tesseract

reza Fri, 18 May 2018 20:54:49 -0700

hi ShreeDevi

Thanks.


I tested the 2 models that you have provided. The accuracy on samples 
without noise were about 98% but on scanned samples or captured images, 
were about 80%.
but still it didn't work on different fonts.
Could u send all files that needed for training models? I want fine tune 
the model with more fonts and diacritics .

best regards
 

On Friday, May 18, 2018 at 8:49:54 PM UTC+4:30, shree wrote:
>
> I have posted a couple of test models for Farsi at 
> https://github.com/Shreeshrii/tessdata_shreetest
>
> These have not been trained on text with diacritics as the normalization 
> and training process was giving error on the combining marks.
>
> Please give them a try and see if they provide better recognition for 
> numbers and text without combining marks.
>
> FYI, I do not know the Persian language so it is difficult for me to gauge 
> if results are ok or not.
>
> ShreeDevi
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/fe15cedc-0a2a-41fc-ac3c-b80df458a509%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: [tesseract-ocr] train more fonts on trained model fas in tesseract

Reply via email to