I’m training Sinhala pack for the tesseract;
Sinhala includes Modifiers / Vowels and constant fonts.
While training we can flows two models.
<https://lh4.googleusercontent.com/-Rl8ABPjHgu0/UFxGe0Pk0WI/AAAAAAAAAHc/ubRRm-ekQI8/s1600/comb.png>1)
We can consider hole character as one (image 01)
<https://lh6.googleusercontent.com/-ONNKry_7Vxc/UFxGieJc_mI/AAAAAAAAAHk/u5O4ZTFE4FY/s1600/sep.png>2)
or Modifiers and constant separates (image 02)
Both models might work, but to gaining higher accuracy which is the
best/prefer model?
regards
Ruwanthaka
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en