Hi, thanks for the hint, this improved my results. :).
On Wednesday, September 12, 2012 6:47:13 PM UTC+2, sventech wrote: > > The image is inverted. Use a free image program or library to make it > black on white. ImageMagick is a popular choice. Tesseract OCR does > not handle white on black well. > --Sven > > On Wed, Sep 12, 2012 at 10:55 AM, Mike <[email protected] <javascript:>> > wrote: > > Hi me again, > > > > I attached another image which is quite similar, but in this case > tesseract > > fails to detect the correct output in any way, so not even using the > > tesseract exe, with psm set to 8 or leaving it as auto > > Has anybody got an idea why? Does this mean one has to teach tesseract > to > > read this character correctly? > > > > Thanks, > > Mike > > > > > > On Monday, September 3, 2012 11:06:41 AM UTC+2, Mike wrote: > >> > >> Hi, > >> > >> maybe someone can point me into the right direction. > >> I use Windows 7 32 bit. > >> When taking the attached image and loading it with tesseract.exe (3.01) > >> via following command: tesseract.exe OCR_MONO_DEBUG.jpg test -l eng > -psm 8 > >> The result is correct. > >> However I use the following functions (where image is the attached file > >> read internally by my program converted to 1 byte mono): > >> > >> pTessBase->SetPageSegMode(tesseract::PSM_SINGLE_WORD); > >> pTessBase->SetImage(pImage, width, height, 1, width); > >> char* ocr_result = pTessBase->GetUTF8Text(); > >> > >> Then oddly enough I do not get any results, all I get is an empty > string. > >> Setting whitelist to only numbers does not help either. When I have 2 > >> numbers to recognize such as 81 then all works fine. > >> > >> Thanks in advance. > >> Mike > > > > -- > > You received this message because you are subscribed to the Google > > Groups "tesseract-ocr" group. > > To post to this group, send email to > > [email protected]<javascript:> > > To unsubscribe from this group, send email to > > [email protected] <javascript:> > > For more options, visit this group at > > http://groups.google.com/group/tesseract-ocr?hl=en > > > > -- > ``All that is gold does not glitter, > not all those who wander are lost; > the old that is strong does not wither, > deep roots are not reached by the frost. > From the ashes a fire shall be woken, > a light from the shadows shall spring; > renewed shall be blade that was broken, > the crownless again shall be king.” > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

