On Monday, December 9, 2013 3:07:49 AM UTC-6, Vamsee wrote:
>
> Hi,
>
> I'm using jTessBoxEditor to train tesseract in ubuntu.
>
> Problem1.
>           jTessBoxEditor is not able to detecting boxes from  
> vamsi.urwpalladiolbi.exp01.tif  but  from vamsi.urwpalladiolbi.exp0.tif it 
> can detect the boxes.Both of them are same file format but different 
> resolution.
> Does Resolution matters for detecting the boxes?
>

Yes. For a given image, the box editor uses Tesseract executable to 
recognize the image in creating the box file. A low-resolution image will 
present problems for Tesseract.

>
> Problem2:
>             How can we train the file with different fonts? or is it 
> possible only with UTF-8 chracters(text document font format).
>

Training Wiki states: *DO NOT MIX FONTS IN AN IMAGE FILE* (In a single .tr 
file to be precise.) This will cause features to be dropped at clustering, 
which leads to recognition errors. 

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to