Hi Nathan, I adapted the Tesseract API examples (https://code.google.com/p/tesseract-ocr/wiki/APIExample) to do this for saving line images to feed into another OCR program: https://github.com/ryanfb/tesslinesplit If you change RIL_TEXTLINE to RIL_WORD in tesslinesplit.cpp and compile, it should work for splitting an image into words.
Best, -Ryan On Sunday, September 27, 2015 at 12:38:58 PM UTC-4, Nathan Cain wrote: > > I have a project similar to recaptcha where I need humans to type words > instead of computer ocr. Is there a way for tesseract to split an image > into words and output the words as separate image files? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/66d17245-7121-4611-a4d4-53f6e7529a45%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

