On 11 July 2010 13:32, sj08 <[email protected]> wrote: > I was wondering if I could pipe a screendump from xwdtotiff or some > other program straight to tesseract and then pipe the lot into one > long text file. (Or have tesseract append the text to a file) > I have a large number (4000 plus) of image slides (actually > individual .swf files) I want to grab the text off. Is this possible? > I don't really want to spend the next 6mths typing. :-[ >
I don't see why not, as long as the text is clear enough to be recognised. FWIW, there are a number of tools out there for extracting various content items (such as images) from .swf files - it might be worth your while to investigate those, as the images might be better quality than a screendump (not to mention that it would probably be quicker than generating screendumps). -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

