Update: I installed it and ran it on an Ubuntu 12 box, but I got the same result. So it doesn't appear that anything is wrong with my environment. Is something wrong with my default settings? My image? Doesn't Google use tesseract to do OCR in Google Docs? I uploaded the same image to google docs and it translated it perfectly. I'm not sure what I'm doing wrong.
On Wednesday, April 2, 2014 2:23:20 PM UTC-4, Joel Wheeler wrote: > > Hello- I've downloaded and compiled the source for Tesseract 3.02.02 and > installed the English learning files. To test the installation I am using > what I believe to be a pristine input image which was just a screenshot of > some tesseract home page text- so there is no markings or anything like > that which may occur from a scanned text image. I fed this image into > tesseract but the output is mostly garbled and the conversion very poor. A > couple of words were correctly translated but the rest is not close. I have > a feeling that something may be off in my environment or something and > someone might see the output and know immediately what my issue is. > > I'm running on a 64 bit Linux RedHat 6 install. > > Initially I built and installed tesseract 3.03 but had the same result. So > I ran 'sudo make uninstall' and then compiled and installed 3.02 instead > but there was very little difference in the output. > > I confirmed that english language was installed with: > > > tesseract --list-langs > List of available languages (1): > eng > > I've attached the input image as well as the file containing the output > text. > > Is this output what I should expect, or is something definitely off here? > Any assistance that folks could give would be greatly appreciated! > > Thank you. > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.

