The image is inverted. Use a free image program or library to make it black on white. ImageMagick is a popular choice. Tesseract OCR does not handle white on black well. --Sven
On Wed, Sep 12, 2012 at 10:55 AM, Mike <[email protected]> wrote: > Hi me again, > > I attached another image which is quite similar, but in this case tesseract > fails to detect the correct output in any way, so not even using the > tesseract exe, with psm set to 8 or leaving it as auto > Has anybody got an idea why? Does this mean one has to teach tesseract to > read this character correctly? > > Thanks, > Mike > > > On Monday, September 3, 2012 11:06:41 AM UTC+2, Mike wrote: >> >> Hi, >> >> maybe someone can point me into the right direction. >> I use Windows 7 32 bit. >> When taking the attached image and loading it with tesseract.exe (3.01) >> via following command: tesseract.exe OCR_MONO_DEBUG.jpg test -l eng -psm 8 >> The result is correct. >> However I use the following functions (where image is the attached file >> read internally by my program converted to 1 byte mono): >> >> pTessBase->SetPageSegMode(tesseract::PSM_SINGLE_WORD); >> pTessBase->SetImage(pImage, width, height, 1, width); >> char* ocr_result = pTessBase->GetUTF8Text(); >> >> Then oddly enough I do not get any results, all I get is an empty string. >> Setting whitelist to only numbers does not help either. When I have 2 >> numbers to recognize such as 81 then all works fine. >> >> Thanks in advance. >> Mike > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

