Given that you are continuing the vague emails, here's my best solution.

I am not convinced your binarization is happening at the level that
tesseract requires. I would suggest looking at

a) a good conversaion to gray scale
b) fg/bg separation using histograms of intensities
c) binarization

then run tesseract and see what you get. Please post intermediatory results
and their outputs NOT your interpretations (vaguely!!!) of them for better
help.

M

On Tue, Apr 17, 2012 at 1:48 AM, AMetnik <[email protected]> wrote:

> Since i'm not allowed to update my post under similar title i will restart
> it here:
>
> I was hoping someone could tell me why it is my Tesseract has trouble
> recognizing some images with digits, and if there is something i can do
> about it.
> Everything is working according to test, and since it is only digits i
> need, i thought i could manage with the english pattern untill i had to
> start with the 7segmented display aswell.
>
> Though i am having a lot of trouble with the appended images, i'd like to
> know if i should start working on my own recognition algorithms or if I
> could do my own datasets for Tesseract and then it would work, does anyone
> know where the limitation lies with Tesseract?
>
> things tried:
> tried to set psm to one_line, one_word, one_char(and chop up the picture).
> With one_line and one_word there was no significant change.
> with one_char it did recognize a bit better, but sometimes, due to big
> spacing it attached an extra number to it, which then screwed it up, if you
> look at the attached image zero.jpg then it resulted in 04.
> I have also tried to do the binarization myself, this resulted in poorer
> recognition and was very rescource consuming.
> I have tried to invert the pictures, this makes no difference at all for
> tesseract.
>
> I have attached the pictures i'd need, among others, to be processed.
>
> Explaination about the images:
> decodethisimage_seven is a image that the tesseract has no trouble
> recognizing, though it has been made in word for the conveniences of
> building an app around a working image.
> decodethisimage_eight is real life image matching the image_seven. But it
> cannot recognize this.
> decodethisimage_four2 is another image i'd like it to recognize, and yes i
> know it cant be skrewed, and i did unskrew(think skrew is the term
> here=="straighting") it when testing.
>
>
>
>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>



-- 

URL:
www.cse.msu.edu/~mudigon1
www.blindsight.com/team
Elegance is not a dispensable luxury but a factor that decides between
success and failure.
Edsger Dijkstra

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to