Re: [ocropus] Re: Using a "dictionary" for recognition?

Amrit Tue, 01 Mar 2011 17:07:01 -0800

Thanks a lot Tom for your in depth answers.
I do have some followup queries inlined in blue.


Also,I tried testing some of the images with stand alone tesseract as the
OCR recognizer and found that the results were on average better ,in case
the images do not demand any layout analysis(which is expected,I believe).I
had come across discussion threads stating tesseract is not the default OCR
for ocropus and the pluggable integration is still in works.Any updates on
this?

Thank you again for the help and on my part I would like to share my work
with the other members as soon as it yields demonstrable results.

Regards,
Amrit.

On Tue, Mar 1, 2011 at 4:16 AM, Tom <[email protected]> wrote:

>         I am trying to build a zipcode recognizer for a set of postal
>
>>  images using OCRopus.I have been using dict2linefst to create a custom
>> language model for the images
>> In doing so I wanted to clarify the following:
>> 1.I have observed in the dict-costs , each word is associated with a
>> corresponding weight which is used internally to build a WFST using
>> extended openfst. Are there any guidelines for assigning these weights
>> in the dictionary? . At present I am constructing my LM from a
>> dictionary which contains elements such as
>> 0.0,
>> 0.0,City State Zipcode
>> 0.0,City State Zipcode
>>
>
> The dic-costs file is just a sample input to the wdict2wordfst script.
> However, none of those scripts are really meant for production use.  You
> probably need to write your own script using the PyOpenFST library or the
> OpenFST tools in order to build a working language model.
>
>
    >>Thanks.I am in-fact using my own script based on the dict2linefst as
example.It has helped me eliminate all the unnecessary character output that
I was seeing before.

>
>
 2.The default character model does not yield any result while using it
>>
>> with my custom LM.On the other hand if I use 2m2-reject.cmodel (found
>> under ocropy/) I am able to decode the image by force aligning the
>> output with the LM.Any inputs as to why this should be the case?
>>
>
> The default is probably a line recognizer model; it may simply not
> recognize some of the characters at all and the way it constructs the FST
> may leave the segmentation graph unconnected.
>
> If you use a character model like 2m2-reject.cmodel, it uses the new
> character recognizer (CmodelLineRecognizer), which tries harder to keep the
> segmentation graph connected.
>
> Note that 2m2-reject.cmodel, while fairly good, doesn't take into account
> character geometry.  There's a new set of character models being trained
> that does.
>
> You can see better what's going on with ocropus-showlrecs
>
> You can also look at the raw recognizer output with "fstdraw output.fst >
> output.dot; dotty output.dot" or something like that (have a look at the
> OpenFST documentation).
>
> Note that the preferred way of running OCRopus is *not* using ocropus-pages
> (that mainly serves to illustrate the different software components).
> Instead, run the sequence:
>
> ocropus-binarize
> ocropus-pseg
> ocropus-lattices
> ocropus-align
>
> (ocropus-lattices and ocropus-align replace ocropus-calign)
>

>>I will follow up as suggested.Thank you for pointing out the correct run
sequence as well.However in my current installation(about a month old
    OCRopus version 0.4.4 hg clone) I do not see ocropus-lattices and
ocropus-align only ocropus-calign.I am presuming these
    changes have been included recently.

>
>  3.Though I am able to obtain the results by the above approach the
>> accuracy is poor(due to constraint LM , wrong City,Zipcode
>> combinations are dumped as results).Do I need to train a cmodel on
>>
>> these set of images using ocropus-calign or is there a way of
>> optimizing it without training?
>>
>
> You probably need to train a new cmodel.
>
>
>> 4.Also, is there a way of assigning weights to the character
>> model(recognizer) and LM in OCRopus , for example as done in Automatic
>> Speech Recognition side with the use of different weights for acoustic
>> models and language models?
>>
>
> You can compile different weights into the language model, which amounts to
> the same thing.  There is some code to do that at recognition time, but
> currently no command line option (there will be at some point to make
> recognition easier).
>
>
>> 5. Training with ocropus-calign fails. If I try and use ocropus-calign
>> as per the instructions :
>> http://groups.google.com/group/ocropus/browse_thread/thread/4f3a2ee1a..
>>    It always fails to find the ground truth file ( .gt.txt ) , any
>> suggestions? ( I am trying to build on the 2m2-reject.cmodel as the
>> character model)
>>
>
> Without more info, I can't tell what the problem is; but if it fails to
> find the ground truth file, it's probably in the wrong place.  You can check
> with "strace -eopen ..." what files it is actually trying to open.
>
> Note that ocropus-calign has been replaced by ocropus-lattices +
> ocropus-align, and the options have changed.
>
  >>My apologies I should have made explicitly stated the error encountered
at my end.Below is the logged output that I see:
$ ocropus-calign -x .gt.txt -m 2m2-reject.cmodel 010004.png
loading ../../models/2m2-reject.cmodel
*** ('010004.png', None)
[[[
load 010004.png
lraw   0.00 12 s-u\v,|ok9aa
gt 010004.gt.txt
ERROR 010004.gt.txt failed to load
amrit@amrit:/media/Data/OCR/images/images/IMAGE-Results/TEST$ cat
010004.gt.txt
SOUTHBURY, CT 06488

 >>This is just one example where I am using a single image ,it starts
decoding and fails to read the gt file which is refered by .gt.txt
extension.The same error occurs for bulk also.010004.png contains only the
stated ground truth as image.Do let me know in case I am usage is at fault.

>>As suggested I would try this with the revamped ocropus-lattice +
ocropus-align implementation as well . Is there any changes to the steps in
training cmodel as opposed to the ones described for ocropus-calign ?



>
>
>> 6.Any other suggestions which might prove effective in implementing
>> such a zip code recognizer using OCRopus as per your experience.
>>
>
> Recognition rates should be very good with a language model; I've built
> those kinds of models for handwriting recognition.
>
> However, you probably should build the model yourself, rather than using
> the Python scripts in pyopenfst (see above).
>
> Tom
>
> --
> You received this message because you are subscribed to the Google Groups
> "ocropus" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected].
> For more options, visit this group at
> http://groups.google.com/group/ocropus?hl=en.
>

-- 
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en.

Re: [ocropus] Re: Using a "dictionary" for recognition?

Reply via email to