His,

thanks for your message, I really should have scrutinized the documentation
first.
I tried scaling the image to 1779x100px and set resolution to 92dpi (all in
gimp)

Tesseract produces then:
HHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHHhhhhhhhhhhhhVhVhVhVhVhVHHHHHH$lll
444l<W<<il4 IUJWWIIIIIIIIWNUWWWlUJ <|1U

Could you please advise if I am heading in the right direction by trying to
scale image to get a meaningful text out even though the original is ocr
averse?

thanks

On Fri, May 29, 2009 at 7:06 PM, Ray Smith <[email protected]> wrote:

> RTFM. See the FAQ on small text.Ray.
>
>
> On Tue, May 19, 2009 at 1:33 PM, denis56 <[email protected]>wrote:
>
>>
>> Here is the link to three files that I mentioned (original, converted
>> with java imageio package, and with Image Converted utility)
>> http://www.speedyshare.com/732780799.html
>>
>> Thanks
>>
>> On 19 Mai, 16:27, denis56 <[email protected]> wrote:
>> > His again,
>> >
>> > after having installed tesseract, I ran it against tif files.
>> > Unfortunately text is not being recognized.
>> >
>> > The tiff files were produced by converting a png images (yellow
>> > background, red font)
>> > 1) with java ImageIO
>> > boolean b = ImageIO.write(image, "tiff", fileName);
>> >
>> > - when running tesseract against this type an empty file will be
>> > outputted
>> >
>> > 2) with Image Converter .EXE utility on Windows
>> >
>> > - tesseract churns out following text
>> > \\\\\\\\\\\\\\\\\\\\\HHHHHHHHHHHH\\\\\\\\\\\\\\\\\UU\\\\\\\\\\\\\\\H\W
>> >
>> > While feeding tesseract with eurotext.tif sample file produces perfect
>> > output.
>> >
>> > Could anyone suggest possible reasons for failure. Maybe background
>> > and text flow together, special care should be taken by converting png
>> > into tiffs?
>> >
>> > Thanks
>>
>>
>
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to