His again,

after having installed tesseract, I ran it against tif files.
Unfortunately text is not being recognized.

The tiff files were produced by converting a png images (yellow
background, red font)
1) with java ImageIO
boolean b = ImageIO.write(image, "tiff", fileName);

- when running tesseract against this type an empty file will be
outputted

2) with Image Converter .EXE utility on Windows

- tesseract churns out following text
\\\\\\\\\\\\\\\\\\\\\HHHHHHHHHHHH\\\\\\\\\\\\\\\\\UU\\\\\\\\\\\\\\\H\W

While feeding tesseract with eurotext.tif sample file produces perfect
output.

Could anyone suggest possible reasons for failure. Maybe background
and text flow together, special care should be taken by converting png
into tiffs?

Thanks
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to