hi all! find below the log on generating a tr file; ==== Page 0 APPLY_BOXES: Boxes read from boxfile: 3312 Boxes failed resegmentation: 0 Found 3312 good blobs and 3 unlabelled blobs in 0 words. 0 remaining unlabelled words deleted. TRAINING ... Font name = TAMKambanNarrow Generated training data for 220 words Page 1 APPLY_BOXES: Boxes read from boxfile: 3312 Boxes failed resegmentation: 0 Found 3312 good blobs and 3 unlabelled blobs in 0 words. 0 remaining unlabelled words deleted. Generated training data for 232 words ============
normally i get "0 unlabelled blobs in 0 words" and if i deliberately deleted any boxes i get "nn boxes in 0 words"; but in this particular tif and box files all orginally generated boxes are labelled (either individually or after merging or splitting); so no blob is left unlabelled; i went through the box/tif file using jTess box editor; but i could not locate any unlabelled blobs; is there a way to generate the box-coordinates in the log file so that i can definitely check that all boxes are covered? regards rnkantan -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

