I have problem using tesseract in training using character image. Examples of the problem is described as follows. Box and Tif files are attached. Box: https://docs.google.com/file/d/0Bz99K1Qj2HQ_dkZKUW5RdDU1Tk0/edit Tif: https://docs.google.com/file/d/0Bz99K1Qj2HQ_WkJqOHI0OHU3Nnc/edit Case 1: ===command=== tesseract test.ming.24.tif test.ming.24 batch.nochop box.train ===output message=== Tesseract Open Source OCR Engine v3.02 with Leptonica Empty page!! Empty page!! Case 2: Telling Tesseract there is only one single character ===command=== .\tesseract test.ming.24.tif test.ming.24 -psm 10 batch.nochop box.train ===output message=== Tesseract Open Source OCR Engine v3.02 with Leptonica Bounding box=(16,23)->(28,32) Bounding box=(16,15)->(28,24) APPLY_BOXES: boxfile line 0/??((8,14),(36,41)): FAILURE! Couldn't find a matchin g blob APPLY_BOXES: Boxes read from boxfile: 1 Boxes failed resegmentation: 1 APPLY_BOXES: Unlabelled word at :Bounding box=(16,15)->(28,32) APPLY_BOXES: Unlabelled word at :Bounding box=(8,14)->(36,41) Found 0 good blobs. 2 remaining unlabelled words deleted. Generated training data for 0 words Any options needed to be specified to make it work? Thanks a lot. Regards, W. K. Lo
-- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

