Tesseract does not work well for fewer than 4 chars, I think, and your image is very pixelated. Sven
On Friday, January 4, 2013, Mike wrote: > Hi, I am still facing an issue where the number 8 is not detected, > > Here is a way to reproduce the problem using binaries downloaded from the > tesseract site. > I downloaded the tesseract portable ( > http://code.google.com/p/tesseract-ocr/downloads/detail?name=tesseract-ocr-3.02-win32-portable.zip&can=2&q=) > and ran following command line with the attached image to this post. > tesseract.exe -l eng -psm 8 OCR_MONO_DEBUG.jpg test > in test.txt i get following string "/" > I would expect "8", I would really appreciate it a lot if anyone can > verify this behaviour on their side. > > Thanks in advance, > Mike > > On Thursday, September 13, 2012 12:19:13 PM UTC+2, Mike wrote: >> >> Hi, >> >> Thanks for the info. I am using revision 700, now I tried what "sventech" >> explained and it improved my results. I will integrate the latest revision >> and see if it then even gets better. >> >> On Wednesday, September 12, 2012 11:09:29 PM UTC+2, Stane wrote: >>> >>> Does the example images work with your code? >>> >>> If us the tesseract 3.02 api to detect your image(white 8 on black >>> ground), it get recognized without problems >>> Iam using the default PageSegMode and OEM_TESSERACT_ONLY. >>> Hope that helps somehow. >>> >>> On Monday, September 3, 2012 11:06:41 AM UTC+2, Mike wrote: >>>> >>>> Hi, >>>> >>>> maybe someone can point me into the right direction. >>>> I use Windows 7 32 bit. >>>> When taking the attached image and loading it with tesseract.exe (3.01) >>>> via following command: tesseract.exe OCR_MONO_DEBUG.jpg test -l eng -psm 8 >>>> The result is correct. >>>> However I use the following functions (where image is the attached file >>>> read internally by my program converted to 1 byte mono): >>>> >>>> pTessBase->SetPageSegMode(**tesseract::PSM_SINGLE_WORD); >>>> pTessBase->SetImage(pImage, width, height, 1, width); >>>> char* ocr_result = pTessBase->GetUTF8Text(); >>>> >>>> Then oddly enough I do not get any results, all I get is an empty >>>> string. Setting whitelist to only numbers does not help either. When I have >>>> 2 numbers to recognize such as 81 then all works fine. >>>> >>>> Thanks in advance. >>>> Mike >>>> >>> -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to > [email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > To unsubscribe from this group, send email to > [email protected] <javascript:_e({}, 'cvml', > 'tesseract-ocr%[email protected]');> > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

