> I've managed to get OCRopus to output lattices in the latest development > HG-checkout, with the ocropy command "ocropus-linerec" (which it seems to > output by default). > However, as far as I can tell the binary OpenFST files do not contain > embedded symbol tables. > > I.e. if I convert the binary FST to a text FST, I get something like this: > 0 1 65537 33 16.7413559 > 0 24 65537 33 11.7413559 > 0 1 65537 49 17.3978558 > 0 25 65537 49 12.3978558
That's correct. We don't use FST symbol tables because they aren't supported by the other FST libraries that we're using. Instead, we just use Unicode codepoints, plus a special code for ligatures. We may revisit that. Just access the integer output values and convert to Unicode. Anything of the form 0x01?????? is a ligature. Tom -- You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/ocropus?hl=en.
