Hello. I've created a box file and then edited. When I create the .tr file I've got the following messages " APPLY_BOXES: Unlabelled word at :Bounding box"
Tesseract Open Source OCR Engine v3.02 with Leptonica row xheight=11, but median xheight = 9.5 row xheight=24, but median xheight = 9.5 row xheight=20.5, but median xheight = 9.5 row xheight=20, but median xheight = 9.5 row xheight=20.5, but median xheight = 9.5 row xheight=31.3333, but median xheight = 9.5 row xheight=104, but median xheight = 9.5 row xheight=36.6667, but median xheight = 9.5 row xheight=19, but median xheight = 9.5 row xheight=19, but median xheight = 9.5 row xheight=19.2222, but median xheight = 9.5 row xheight=18, but median xheight = 9.5 row xheight=19.2222, but median xheight = 9.5 row xheight=19.2222, but median xheight = 9.5 row xheight=30, but median xheight = 9.5 row xheight=30, but median xheight = 9.5 row xheight=149.333, but median xheight = 9.5 row xheight=17, but median xheight = 9.5 row xheight=19, but median xheight = 9.5 row xheight=26, but median xheight = 9.5 row xheight=32.6667, but median xheight = 9.5 row xheight=63.3333, but median xheight = 9.5 FAIL! APPLY_BOXES: boxfile line 16/Í ((171,405),(174,428)): FAILURE! Couldn't find a matching blob APPLY_BOXES: Boxes read from boxfile: 99 Boxes failed resegmentation: 1 APPLY_BOXES: Unlabelled word at :Bounding box=(633,657)->(670,699) APPLY_BOXES: Unlabelled word at :Bounding box=(874,658)->(914,694) APPLY_BOXES: Unlabelled word at :Bounding box=(926,656)->(929,667) APPLY_BOXES: Unlabelled word at :Bounding box=(938,661)->(949,684) APPLY_BOXES: Unlabelled word at :Bounding box=(959,656)->(962,667) APPLY_BOXES: Unlabelled word at :Bounding box=(983,667)->(997,690) APPLY_BOXES: Unlabelled word at :Bounding box=(283,580)->(479,627) APPLY_BOXES: Unlabelled word at :Bounding box=(492,585)->(536,610) APPLY_BOXES: Unlabelled word at :Bounding box=(547,585)->(695,610) APPLY_BOXES: Unlabelled word at :Bounding box=(834,542)->(898,615) APPLY_BOXES: Unlabelled word at :Bounding box=(916,615)->(919,617) APPLY_BOXES: Unlabelled word at :Bounding box=(293,539)->(430,567) APPLY_BOXES: Unlabelled word at :Bounding box=(440,539)->(479,567) APPLY_BOXES: Unlabelled word at :Bounding box=(490,540)->(640,572) APPLY_BOXES: Unlabelled word at :Bounding box=(646,547)->(685,569) APPLY_BOXES: Unlabelled word at :Bounding box=(918,548)->(920,571) APPLY_BOXES: Unlabelled word at :Bounding box=(304,515)->(346,536) APPLY_BOXES: Unlabelled word at :Bounding box=(356,514)->(498,538) APPLY_BOXES: Unlabelled word at :Bounding box=(508,516)->(671,538) APPLY_BOXES: Unlabelled word at :Bounding box=(917,501)->(921,545) APPLY_BOXES: Unlabelled word at :Bounding box=(918,477)->(922,498) APPLY_BOXES: Unlabelled word at :Bounding box=(105,590)->(918,636) APPLY_BOXES: Unlabelled word at :Bounding box=(126,458)->(264,613) APPLY_BOXES: Unlabelled word at :Bounding box=(104,463)->(107,517) APPLY_BOXES: Unlabelled word at :Bounding box=(28,376)->(45,397) APPLY_BOXES: Unlabelled word at :Bounding box=(3,344)->(14,354) APPLY_BOXES: Unlabelled word at :Bounding box=(102,342)->(104,373) APPLY_BOXES: Unlabelled word at :Bounding box=(103,419)->(105,459) APPLY_BOXES: Unlabelled word at :Bounding box=(103,386)->(105,416) APPLY_BOXES: Unlabelled word at :Bounding box=(714,242)->(892,492) APPLY_BOXES: Unlabelled word at :Bounding box=(101,273)->(103,279) APPLY_BOXES: Unlabelled word at :Bounding box=(101,254)->(103,264) APPLY_BOXES: Unlabelled word at :Bounding box=(712,242)->(732,260) APPLY_BOXES: Unlabelled word at :Bounding box=(860,250)->(863,255) APPLY_BOXES: Unlabelled word at :Bounding box=(101,145)->(337,243) APPLY_BOXES: Unlabelled word at :Bounding box=(686,211)->(897,227) APPLY_BOXES: Unlabelled word at :Bounding box=(926,146)->(932,259) APPLY_BOXES: Unlabelled word at :Bounding box=(833,183)->(837,184) APPLY_BOXES: Unlabelled word at :Bounding box=(693,147)->(759,184) APPLY_BOXES: Unlabelled word at :Bounding box=(773,147)->(893,197) APPLY_BOXES: Unlabelled word at :Bounding box=(958,38)->(1022,86) APPLY_BOXES: Unlabelled word at :Bounding box=(0,0)->(1024,94) Found 98 good blobs. Leaving 20 unlabelled blobs in 0 words. 42 remaining unlabelled words deleted. TRAINING ... Font name = monospaced.exp100 Generated training data for 15 words So, after the editing I've generated a training data for 15 words. If I do not edit the file it generates a training data for 48 words. Any idea why this is happening? Thanks. I've attached the edited and not edited box files -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/780b66d4-5f3b-44d5-8bae-984d3da7f3a0%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
c.monospaced.exp100.box
Description: Binary data
c_unedited.monospaced.exp100.box
Description: Binary data

