Hi all,

just for your information, I discovered one small bug in the encoding
routines in morfologik-stemming. We already have a fix and after more
testing, we're going to release a new version. The bug is for synthesizers
with really long POS tags on long words: the inflected form was not encoded
properly.

Regards,
Marcin


2013/10/16 Daniel Naber <[email protected]>

> On 2013-10-04 19:28, Dominique Pellé wrote:
>
> > Why aren't the scripts in of all binary dictionaries in Git?
>
> There are now two small programs for exporting a dictionary and creating
> one:
>
> org.languagetool.dev.DictionaryExporter
> org.languagetool.dev.POSDictionaryBuilder
>
> When you are using the stand-alone version (starting with tomorrow's
> snapshot), they can be called like this to get a usage message:
>
> java -cp languagetool-standalone.jar
> org.languagetool.dev.DictionaryExporter
> java -cp languagetool-standalone.jar
> org.languagetool.dev.BuildPOSDictionary
>
> Building synthesizer and hunspell dictionaries is not yet supported. The
> advantage this approach has over shell scripts is that this way the
> dependency we need (morfologik-tools) can be fetched via Maven, like
> anything else.
>
> Now the problem is that some of the generated dictionaries don't work,
> i.e. they make the tests fail:
> Catalan, French, Italian, Romanian, Spanish.
> Could someone who knows one of these languages have a look and see what
> the problem might be? What I did was this: export the dictionary, then
> create a new dictionary using that export, copying it over the existing
> dictionary (there's actually a test for that: POSDictionaryBuilderTest).
>
> Regards
>   Daniel
>
> --
> http://www.danielnaber.de
>
>
>
> ------------------------------------------------------------------------------
> October Webinars: Code for Performance
> Free Intel webinars can help you accelerate application performance.
> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most
> from
> the latest Intel processors and coprocessors. See abstracts and register >
> http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk
> _______________________________________________
> Languagetool-devel mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to