On Wed, Nov 28, 2012 at 07:49:34PM -0800, Joe Carter wrote: > When it comes to punctuation, does the same apply? I know the training guide > says to make sure that the punctuation is not grouped together, but do the > examples of punctuation have to be plausible? For example, do parentheses have > to be properly matched? e.g. The quick (brown fox} jump over the lazy dog.
To answer this bit specifically, no, grouping of characters doesn't matter here. Rules of punctuation should be specified in the punc-dawg file, if you need them. The tif/box step is all about character shapes, so as long as they're laid out realistically enough that relative size, spacings etc are well accounted for, you'll be fine. Nick -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

