On Monday, December 9, 2013 3:07:49 AM UTC-6, Vamsee wrote: > > Hi, > > I'm using jTessBoxEditor to train tesseract in ubuntu. > > Problem1. > jTessBoxEditor is not able to detecting boxes from > vamsi.urwpalladiolbi.exp01.tif but from vamsi.urwpalladiolbi.exp0.tif it > can detect the boxes.Both of them are same file format but different > resolution. > Does Resolution matters for detecting the boxes? >
Yes. For a given image, the box editor uses Tesseract executable to recognize the image in creating the box file. A low-resolution image will present problems for Tesseract. > > Problem2: > How can we train the file with different fonts? or is it > possible only with UTF-8 chracters(text document font format). > Training Wiki states: *DO NOT MIX FONTS IN AN IMAGE FILE* (In a single .tr file to be precise.) This will cause features to be dropped at clustering, which leads to recognition errors. -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

