Thanks for the help. After making the image bigger and then threshholding i got much better results. But still incomplete reading. the file is not being scanned but downloaded from the web. Changing the format didn't make a difference.
Any other way you can think about to improve? On Dec 21, 10:33 am, SteveP <[email protected]> wrote: > Here are some things to try to get better results: > 1) resize the image larger so characters such as 'e' are at least 20 > to 30 pixels high. > 2) threshold to remove noise; (make gray values above 130 or so all > get mapped to 255). > 3) unsure what tesseract does with bullets; does anyone else know? > 4) If this is a scanned image, rescan at 300 dpi. > 5) I vaguely remember JPEG is not the preferred format; png, bmp, tiff > are better with tesseract if I remember correctly. > > See some of my other posts for additional details. Or search other > posts in this group. > > On Dec 20, 8:45 pm, tomlei <[email protected]> wrote: > > > I just installed tesseract for OCR usage and the first attempt the > > it failed giving me the right txt (most of the words were weird > > characters) > > > the pic > > is:http://www.rentingtime.com/uploads/listing/l0033/0000033158/or48255.jpg > > > i run it through some free online OCR websites and they can ready it. > > > Can anybody explain what am i doing wrong or how to improve tesseract ? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

