Thanks, so I ran this (f-uk-UA.dict is file with frequencies): java -jar ./mfl/morfologik-tools-*-standalone.jar fsa_dump -x -d f-uk-UA.dict | wc -l 1013148
while (uk-UA.dict is old one without frequencies) java -jar ./mfl/morfologik-tools-*-standalone.jar fsa_dump -d uk-UA.dict | wc -l 1623646 is this still correct? Thanks Andriy 2014-12-22 2:41 GMT-05:00 Dawid Weiss <dawid.we...@gmail.com>: >> So it looks like I get less words in the output, or am I reading it wrong? > > You are reading it wrong, Andriy. A final state does not correspond to > a unique path through the automaton. > > AX > AY > > should have two final states (X, Y), whereas: > > AX1 > AY1 > > will only have one final state (1). > > All this is a bit more complicated by the fact that morfologik uses > final bits on arcs rather than actual states... I'd say don't go into > it unless you really have to. You can always dump the content of the > created automaton, it should just return the input used to create it > -- that's how you know the automaton is valid. > > Dawid > > ------------------------------------------------------------------------------ > Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server > from Actuate! Instantly Supercharge Your Business Reports and Dashboards > with Interactivity, Sharing, Native Excel Exports, App Integration & more > Get technology previously reserved for billion-dollar corporations, FREE > http://pubads.g.doubleclick.net/gampad/clk?id=164703151&iu=/4140/ostg.clktrk > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel ------------------------------------------------------------------------------ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel