Hi,

thanks for the hint, this improved my results. :).

On Wednesday, September 12, 2012 6:47:13 PM UTC+2, sventech wrote:
>
> The image is inverted. Use a free image program or library to make it 
> black on white. ImageMagick is a popular choice. Tesseract OCR does 
> not handle white on black well. 
> --Sven 
>
> On Wed, Sep 12, 2012 at 10:55 AM, Mike <[email protected] <javascript:>> 
> wrote: 
> > Hi me again, 
> > 
> > I attached another image which is quite similar, but in this case 
> tesseract 
> > fails to detect the correct output in any way, so not even using the 
> > tesseract exe, with psm set to 8 or leaving it as auto 
> > Has anybody got an idea why? Does this mean one has to teach tesseract 
> to 
> > read this character correctly? 
> > 
> > Thanks, 
> > Mike 
> > 
> > 
> > On Monday, September 3, 2012 11:06:41 AM UTC+2, Mike wrote: 
> >> 
> >> Hi, 
> >> 
> >> maybe someone can point me into the right direction. 
> >> I use Windows 7 32 bit. 
> >> When taking the attached image and loading it with tesseract.exe (3.01) 
> >> via following command: tesseract.exe OCR_MONO_DEBUG.jpg test -l eng 
> -psm 8 
> >> The result is correct. 
> >> However I use the following functions (where image is the attached file 
> >> read internally by my program converted to 1 byte mono): 
> >> 
> >> pTessBase->SetPageSegMode(tesseract::PSM_SINGLE_WORD); 
> >> pTessBase->SetImage(pImage, width, height, 1, width); 
> >> char* ocr_result = pTessBase->GetUTF8Text(); 
> >> 
> >> Then oddly enough I do not get any results, all I get is an empty 
> string. 
> >> Setting whitelist to only numbers does not help either. When I have 2 
> >> numbers to recognize such as 81 then all works fine. 
> >> 
> >> Thanks in advance. 
> >> Mike 
> > 
> > -- 
> > You received this message because you are subscribed to the Google 
> > Groups "tesseract-ocr" group. 
> > To post to this group, send email to 
> > [email protected]<javascript:> 
> > To unsubscribe from this group, send email to 
> > [email protected] <javascript:> 
> > For more options, visit this group at 
> > http://groups.google.com/group/tesseract-ocr?hl=en 
>
>
>
> -- 
> ``All that is gold does not glitter, 
>   not all those who wander are lost; 
> the old that is strong does not wither, 
>   deep roots are not reached by the frost. 
> From the ashes a fire shall be woken, 
>   a light from the shadows shall spring; 
> renewed shall be blade that was broken, 
>   the crownless again shall be king.” 
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to