Training fonts > best practices

Ruran Fri, 21 Sep 2012 05:55:27 -0700


I’m training Sinhala pack for the tesseract; 


Sinhala includes    Modifiers  /  Vowels and  constant fonts.

While training we can flows two models.

<https://lh4.googleusercontent.com/-Rl8ABPjHgu0/UFxGe0Pk0WI/AAAAAAAAAHc/ubRRm-ekQI8/s1600/comb.png>1)
 
We can consider hole character as one (image 01)



<https://lh6.googleusercontent.com/-ONNKry_7Vxc/UFxGieJc_mI/AAAAAAAAAHk/u5O4ZTFE4FY/s1600/sep.png>2)
 
or Modifiers and constant separates (image 02)



Both models might work, but to gaining higher accuracy which is the 
best/prefer model? 



regards

Ruwanthaka


-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Training fonts > best practices

Reply via email to