> I've managed to get OCRopus to output lattices in the latest development
> HG-checkout, with the ocropy command "ocropus-linerec" (which it seems to 
> output by default).  
> However, as far as I can tell the binary OpenFST files do not contain 
> embedded symbol tables.
>
> I.e. if I convert the binary FST to a text FST, I get something like this:
> 0       1       65537   33      16.7413559
> 0       24      65537   33      11.7413559
> 0       1       65537   49      17.3978558
> 0       25      65537   49      12.3978558

That's correct.  We don't use FST symbol tables because they aren't
supported by the other FST libraries that we're using.  Instead, we
just use Unicode codepoints, plus a special code for ligatures.  We
may revisit that.

Just access the integer output values and convert to Unicode.
Anything of the form 0x01?????? is a ligature.

Tom

-- 
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en.

Reply via email to