His again, after having installed tesseract, I ran it against tif files. Unfortunately text is not being recognized.
The tiff files were produced by converting a png images (yellow background, red font) 1) with java ImageIO boolean b = ImageIO.write(image, "tiff", fileName); - when running tesseract against this type an empty file will be outputted 2) with Image Converter .EXE utility on Windows - tesseract churns out following text \\\\\\\\\\\\\\\\\\\\\HHHHHHHHHHHH\\\\\\\\\\\\\\\\\UU\\\\\\\\\\\\\\\H\W While feeding tesseract with eurotext.tif sample file produces perfect output. Could anyone suggest possible reasons for failure. Maybe background and text flow together, special care should be taken by converting png into tiffs? Thanks --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

