Hi, I am still facing an issue where the number 8 is not detected,

Here is a way to reproduce the problem using binaries downloaded from the 
tesseract site.
I downloaded the tesseract portable 
(http://code.google.com/p/tesseract-ocr/downloads/detail?name=tesseract-ocr-3.02-win32-portable.zip&can=2&q=)
 
and ran following command line with the attached image to this post.
tesseract.exe -l eng -psm 8 OCR_MONO_DEBUG.jpg test
in test.txt i get following string  "/"
I would expect "8", I would really appreciate it a lot if anyone can verify 
this behaviour on their side.

Thanks in advance,
Mike

On Thursday, September 13, 2012 12:19:13 PM UTC+2, Mike wrote:
>
> Hi,
>
> Thanks for the info. I am using revision 700, now I tried what "sventech" 
> explained and it improved my results. I will integrate the latest revision 
> and see if it then even gets better.
>
> On Wednesday, September 12, 2012 11:09:29 PM UTC+2, Stane wrote:
>>
>> Does the example images work with your code?
>>
>> If us the tesseract 3.02 api to detect your image(white 8 on black 
>> ground), it get recognized without problems
>> Iam using the default PageSegMode and OEM_TESSERACT_ONLY.
>> Hope that helps somehow.
>>
>> On Monday, September 3, 2012 11:06:41 AM UTC+2, Mike wrote:
>>>
>>> Hi,
>>>
>>> maybe someone can point me into the right direction.
>>> I use Windows 7 32 bit.
>>> When taking the attached image and loading it with tesseract.exe (3.01) 
>>> via following command: tesseract.exe OCR_MONO_DEBUG.jpg test -l eng -psm 8
>>> The result is correct.
>>> However I use the following functions (where image is the attached file 
>>> read internally by my program converted to 1 byte mono):
>>>
>>> pTessBase->SetPageSegMode(tesseract::PSM_SINGLE_WORD);
>>> pTessBase->SetImage(pImage, width, height, 1, width);
>>> char* ocr_result = pTessBase->GetUTF8Text();
>>>
>>> Then oddly enough I do not get any results, all I get is an empty 
>>> string. Setting whitelist to only numbers does not help either. When I have 
>>> 2 numbers to recognize such as 81 then all works fine.
>>>
>>> Thanks in advance.
>>> Mike
>>>
>>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

<<attachment: OCR_MONO_DEBUG.jpg>>

Reply via email to