On Sat, Mar 26, 2011 at 2:34 PM, rpjday <[email protected]> wrote:
> long story short, i'm seeing this issue on my ubuntu 10.10 system: > > http://ubuntuforums.org/showthread.php?t=1599686 > > the packages i have installed: > > * tessearct-ocr > * tesseract-ocr-eng > > which version you installed? > i took a simple screenshot of some text, saved it to a .tif file, then > ran: > > screenshot has usually very low DPI (96?). Suggested DPI for OCR is 300. Have a look on VietOCR (http://sourceforge.net/projects/vietocr/) there is also "screenshot" mode that try to solve this problem (yes it work also for other than Vietnamese language :-) ). > $ tesseract tess.tif tess > > which generated the output file tess.txt, whose content was a single > byte (the newline character). > > i added the option "-l eng", but that made no difference, and I get > no diagnostics. i also checked this out: > > https://help.ubuntu.com/community/OCR > > but i didn't see anything that would resolve this issue. can someone > verify that tesseract can properly process a trivial .tif file on > ubuntu? i just want a working example i can use as a starting point. > thanks. > > rday > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

