I have problem using tesseract in training using character image.
Examples of the problem is described as follows.
Box and Tif files are attached.
Box: https://docs.google.com/file/d/0Bz99K1Qj2HQ_dkZKUW5RdDU1Tk0/edit
Tif: https://docs.google.com/file/d/0Bz99K1Qj2HQ_WkJqOHI0OHU3Nnc/edit
 
Case 1:
===command===
tesseract test.ming.24.tif test.ming.24 batch.nochop box.train
 
===output message===
Tesseract Open Source OCR Engine v3.02 with Leptonica
Empty page!!
Empty page!!
 
Case 2: Telling Tesseract there is only one single character
===command===
.\tesseract test.ming.24.tif test.ming.24 -psm 10 batch.nochop box.train
 
===output message===
Tesseract Open Source OCR Engine v3.02 with Leptonica
Bounding box=(16,23)->(28,32)
Bounding box=(16,15)->(28,24)
APPLY_BOXES: boxfile line 0/??((8,14),(36,41)): FAILURE! Couldn't find a 
matchin
g blob
APPLY_BOXES:
   Boxes read from boxfile: 1
   Boxes failed resegmentation: 1
APPLY_BOXES: Unlabelled word at :Bounding box=(16,15)->(28,32)
APPLY_BOXES: Unlabelled word at :Bounding box=(8,14)->(36,41)
   Found 0 good blobs.
   2 remaining unlabelled words deleted.
Generated training data for 0 words
 
Any options needed to be specified to make it work?
 
Thanks a lot.
 
Regards,
W. K. Lo

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to