ocropus might be better, since it does page layout detection and stuff. -_Sven
On Fri, Jan 4, 2013 at 11:25 AM, tesseract newbie <[email protected]>wrote: > Hi there, > > I apologize if this has been asked already, I wasn't able to find anything > like this. > > I have a large bunch of handwritten pages that have been scanned. > > I know that tesseract isn't good for handwriting, so don't want to attempt > this. > > I would like to split the page into little images. Each image should > contain a single word. No OCR should be attempted. > > I tesseract a tool that can do this? If not, any pointers to other open > source packages will be appreciated. > > Thanks! > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

