On Wed, Apr 25, 2012 at 11:10 AM, Lucas Swartsenburg <[email protected]>wrote:
> Excuse me, the correct term would be: segmentation. So this sentence would > be segmented in: > > "So", "this", "sentence", "would", "be", "in". (all of these are images > of the handwritten words). > > > I am not sure if I got your point correct. But have a look [1]. I attached there example how to split input image to symbol. You can you tesseract::RIL_WORD instead of tesseract::RIL_SYMBOL if you need to split image to words (other possibilities are here [2]) [1] https://groups.google.com/group/tesseract-ocr/msg/e0a5d02702cdac21 [2] http://code.google.com/p/tesseract-ocr/source/browse/trunk/ccstruct/publictypes.h?r=716#185 -- Zdenko -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

