I was doing a bit of profiling of LT using jvisualvm from jdk8 - I run
Ukrainian rules test on huge magazine archives, ~30min run total and
was checking if I can speed it up a bit. And although I was able to
optimize UkrainianWordTokenizer a bit it looked like most time is
spent in FSA5.getNextArc() and FSA5.getArc() (even more time than in
paralellized performCheck() itself - I have 4 CPUs) - see attached
screenshot.

Would anybody know if this this something that's specific to my tests,
or is this something we can optimize, or is it too hard to optimize at
this level?

Thanks
Andriy
------------------------------------------------------------------------------
Want excitement?
Manually upgrade your production database.
When you want reliability, choose Perforce
Perforce version control. Predictably reliable.
http://pubads.g.doubleclick.net/gampad/clk?id=157508191&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to