Re: Utility functions

Pierpaolo Monaco Tue, 12 May 2009 02:56:02 -0700

I have downloaded and installed the last release of ocropus from the
mercurial repositories.
Everything works pretty well.


Now I need to work with the language model so I have download and installed
the last version of openfst (v. 1.1) following the installation procedure
from the official website.

During the compiling and installation process I didn't have any problem but
when i test the shell commands i get this error.

overco...@overcomer-laptop:~$ fstinfo
> /usr/local/share/ocropus/models/default.fst
> fstinfo: error while loading shared libraries: libfst.so.0: cannot open
> shared object file: No such file or directory
>

I get this error from any shell command.

I have this file installed in many directories

overco...@overcomer-laptop:~$ locate libfst.so.0
> /home/overcomer/openfst-1.1/src/lib/.libs/libfst.so.0
> /home/overcomer/openfst-1.1/src/lib/.libs/libfst.so.0.0.0
> /usr/lib/gnome-vfs-2.0/modules/libfst.so.0
> /usr/local/lib/libfst.so.0
> /usr/local/lib/libfst.so.0.0.0
>

Because you are experienced in openfst, could somebody tell me what's the
cause?

thanks.

2009/4/15 Thomas Breuel <[email protected]>

> On Tue, Apr 14, 2009 at 05:04, overcomer <[email protected]>wrote:
>
>> Although the image is pretty clear, the text is really "readable", if
>> I process the image with tesseract the result is disappointing.
>> Tesseract recognizes something like the 15 percent of the characters.
>> I think that is depend because of a not-correct use of the dictionary.
>> Now, i can improve this result because the strings are related between
>> them, and some of them are for example just a name of a person or of a
>> city, so with a limited output.
>> What I need to know, if there is some function that analizes the
>> character and return a value that represent the probabilities of the
>> character to be that one, or another one.
>
>
> Yes; OCRopus 0.3 has the ocr-bpnet classifier; OCRopus 0.4 replaces that
> with a new classifier.  Both the old and the new classifiers output
> posterior probabilities.
>
> In this way when I will rebuild the string, i can use just this
>> probabilities and other implicit informations of document to improve
>> my results.
>
>
> Yes, not only can you do that, OCRopus supports that directly through its
> use of statistical language models.  That is, you can define a statistical
> language model that says something like:
>
> 5.1% London
> 4.9% Paris
> 4.7% New York
> 4.3% Berlin
> ...
>
> If you give OCRopus the input string containing just a city name, you run
> it through its recognizer, and then you apply the statistical language
> model, it will give you the most probable interpretation of the input image.
>
> In 0.3, this process is still a little obscure, in 0.4, you will be able to
> run it directly from the command line.
>
> Tom
>
>
>
> >
>


-- 
-----------------------------
Pierpaolo Monaco
----------------------------

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en
-~----------~----~----~----~------~----~------~--~---

Re: Utility functions

Reply via email to