Re: California License Plate font issues with OCR

Andres Wed, 28 Jul 2010 21:24:50 -0700

Sorry, when I wrote this:

- make an image with my font (one of each from the alphabet), and repeating
the alphabet with different levels of threshold. I think that internally
Tesseract thresholds the images. Hard to explain this, but I think that it
may improve the quality.


I missed to clarify that my intention is to train Tesseract with that image.




2010/7/28 Andres <[email protected]>

> Hello,
>
> I'm working on the same as you, for the licence plates from Argentina, as I
> live in Argentina.
>
> Same as you described, the problem was to locate the licence plate.
>
> Now I'm working with the OCR and then I will work on horizontalizing the
> images, because if they are not completely horizontal, the OCR fails, for
> example today I was getting a 5 instead a of a 6. When I horizontalized the
> image with photoshop, everything turned to ok.
>
> I dont know how is the layout of the positions of letters and numbers in
> California plates, are they assorted ? ...if you know if the character
> should be a number or a letter according to its position, you have two
> options (as far as I know):
>
> - when recognizing char by char, tell Tesseract that you expect a number or
> a letter. I saw that in somewere inside the source code, don't remember
> where.
> - make your own conversion, e.g., if you are expecting a number and you get
> a G, map it to a 6, if you expect a 2 map it to a Z.
>
> I think that I'll use the last one, I'm not on that part yet. I'm getting
> good results on images where the characters are big because of the distance
> of the camera, but in small letters (13 pixels height) things are not good.
>
> So I have a pair of ideas to test, perhaps somebody from the group could
> give me opinions regarding them:
> - following the contour, with polygon approximation of the chars, making an
> image with that contours and running Tesseract on that image (trained for
> that)
> - make an image with my font (one of each from the alphabet), and repeating
> the alphabet with different levels of threshold. I think that internally
> Tesseract thresholds the images. Hard to explain this, but I think that it
> may improve the quality.
>
> If you want to continue speaking about specifics of licence plate
> recognition, we can continue privately because it's off topic. I'm
> interested in continuing. There are many things to speak about, for example,
> the prices of the cameras, light filters, times of execution, etc.
>
> You can write me to andrej100 at gmail
>
> Regards,
>
> Andres
>
>
>
> 2010/7/28 ZIA <[email protected]>
>
> I am writing a license plate recognition application in C#. I am
>> almost done, i have started work on my own OCR,but then I decided to
>> use tessearact-ocr, which now partially works. I provide the
>> california license plate to ocr, but some of the font, it doesn't
>> recognizes, for example, like "Z" becomes number 2, letter "O" becomes
>> "U", and number 4 becomes something else. Any suggestion? any language
>> file or font file that will solve this issue. Beside that in complex
>> images, i am having hard time to locate License plate. but my concern
>> is now on ocr, since i thought i would save time by using tesseract
>> then writing my own neural network. I would really appreciate any
>> ideas or suggestions.
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To post to this group, send email to [email protected].
>> To unsubscribe from this group, send email to
>> [email protected]<tesseract-ocr%[email protected]>
>> .
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en.
>>
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Re: California License Plate font issues with OCR

Reply via email to