Hi,

I am working on a image processing assignment and I would like to use
Tesseract for recognising the letters/numbers on the plate after it
has been located.

I want to use Tesseract as OCR is hard and this library claims that it
can handle skewed and curved lines.

I seems to work reasonably well but I think I need to tweak the
settings.

So far I have told it to only look for A-Z and 0-9.

It seems to try to break the number plate up into words, when it
should be a single 'word'.

It also tries to stick words into all numbers or all letters rather
than letting it mix. e.g. 4536B becomes 45368.

So how do I get it to disable the word breaking, and disable the
dictionary/number classifier part.

Is it possible to tell it some sort of pattern to match? All the
number plates I need to recognise follow these patterns:
S[C or D][A to Z]xxxx[A to Z]
S[C or D][A to Z]xxx[A to Z]
S[C or D][A to Z]xx[A to Z]
EA to Z]xxxx[A to Z]

Some of the S... number plates are split into two lines:

S[C or D][A to Z]
xxxx[A to Z]

S[C or D][A to Z]
xxx[A to Z]

x = [0 to 9]

Thanks,
Leith

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to