Thanks, so I ran this (f-uk-UA.dict is file with frequencies):
java -jar ./mfl/morfologik-tools-*-standalone.jar fsa_dump -x -d
f-uk-UA.dict | wc -l
1013148

while (uk-UA.dict is old one without frequencies)
java -jar ./mfl/morfologik-tools-*-standalone.jar fsa_dump -d uk-UA.dict | wc -l
1623646

is this still correct?

Thanks
Andriy

2014-12-22 2:41 GMT-05:00 Dawid Weiss <dawid.we...@gmail.com>:
>> So it looks like I get less words in the output, or am I reading it wrong?
>
> You are reading it wrong, Andriy. A final state does not correspond to
> a unique path through the automaton.
>
> AX
> AY
>
> should have two final states (X, Y), whereas:
>
> AX1
> AY1
>
> will only have one final state (1).
>
> All this is a bit more complicated by the fact that morfologik uses
> final bits on arcs rather than actual states... I'd say don't go into
> it unless you really have to. You can always dump the content of the
> created automaton, it should just return the input used to create it
> -- that's how you know the automaton is valid.
>
> Dawid
>
> ------------------------------------------------------------------------------
> Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
> from Actuate! Instantly Supercharge Your Business Reports and Dashboards
> with Interactivity, Sharing, Native Excel Exports, App Integration & more
> Get technology previously reserved for billion-dollar corporations, FREE
> http://pubads.g.doubleclick.net/gampad/clk?id=164703151&iu=/4140/ostg.clktrk
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel

------------------------------------------------------------------------------
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to