You will need to perform some kind of pre-processing before sending it to 
Tesseract. 

For instance, if you always knew the ticket was a certain size and the 
image was always straight, you could first crop out the rectangular areas 
for each row (I'm assuming you are looking for row numbers here but the 
same applies to other areas).

Once you have done that, I ran Tesseract on the first row ...

1.8 27 29 3(2) 37 50

Nearly, but not quite.

I then created a thresholded image of the cropped image that made the black 
lines bolder.

18 27 29 30 37 50

Perfect.

Cheers

On Thursday, 13 November 2014 02:51:07 UTC, Danilo Tuler wrote:
>
> Hi,
>
> I'm trying to scan the attached lottery tickets. (not winning tickets 
> unfortunately :-)
> Those scans are grayscale 300dpi TIFF's.
>
> I tried with the standard english language with little success.
> Then I tried to create a new language, with one or two fonts, and train it.
> The results were even worse.
>
> What do you think?
>
> Thanks,
> Danilo
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/abd5987b-6321-4fcf-8dad-ef30d5cff2d3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to