Hi

Why aren't the scripts in of all binary dictionaries in Git?
They are useful when dictionaries need to be upgraded.
And they help maintainers of other languages to figure out
how to their dictionaries. A script is less ambiguous than
documentation.

I have not created a synthesizer dictionary yet, but
the POS and FSA for French and Breton
are created from scripts in Git:

languagetool-language-modules/fr/src/main/resources/org/languagetool/resource/fr/create-lexicon.sh
languagetool-language-modules/br/src/main/resources/org/languagetool/resource/br/create-lexicon.pl
languagetool-language-modules/br/src/main/resources/org/languagetool/resource/br/hunspell/create-fsa-spell-dictionary.sh

In fact, I saw that one of the reasons for not packaging
LanguageTool in Debian, is because we don't automate
the creation of binary dictionaries:

https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/114375

Regards
Dominique

Jaume Ortolà i Font wrote:

> Daniel,
>
> I found the same problem recently. I resorted to the attached perl script
> for this step.
>
> Regards,
> Jaume Ortolà
>
> 2013/10/4 Daniel Naber <[email protected]>
>>
>> Hi,
>>
>> did anybody recently build a synthesizer? When I follow the instructions
>> at http://wiki.languagetool.org/developing-a-tagger-dictionary#toc8 I
>> get messages like this:
>>
>> Line number 1 has less than 3 tab-separated fields: zzzz|I      zzzz
>>
>> So what's the correct input format for the tab2morph step? When I use a
>> format with three tab-separated columns the synth dict I get is very
>> large (10MB).
>>
>> Regards
>>   Daniel
>>
>> --
>> http://www.danielnaber.de

------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60134791&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to