Hi,

I tried creating an fsa dictionary with frequency information from my
German word list[1] as detailed on the Wiki[2].

The creation process worked alright apparently, but when I tried to dump
the binary dictionary to a list with the command

java -cp languagetool.jar org.languagetool.dev.DictionaryExporter de-DE.dict

I got the following error in LT 2.7:

Unhandled program error occurred.
Invoke with '--help' for help.
java.lang.IndexOutOfBoundsException
        at java.nio.Buffer.checkBounds(Buffer.java:559)
        at java.nio.HeapByteBuffer.put(HeapByteBuffer.java:181)
        at
morfologik.stemming.DictionaryLookup.decodeBaseForm(DictionaryLookup.java:312)
        at 
morfologik.stemming.DictionaryIterator.next(DictionaryIterator.java:95)
        at 
morfologik.stemming.DictionaryIterator.next(DictionaryIterator.java:15)
        at morfologik.tools.FSADumpTool.dump(FSADumpTool.java:171)
        at morfologik.tools.FSADumpTool.go(FSADumpTool.java:75)
        at morfologik.tools.Tool.go(Tool.java:45)
        at morfologik.tools.FSADumpTool.main(FSADumpTool.java:285)
        at 
org.languagetool.dev.DictionaryExporter.main(DictionaryExporter.java:41)

The binary dictionary with frequency data that I created can be found at
[3].

Best,
Jan

[1] http://sourceforge.net/projects/germandict/
[2] http://wiki.languagetool.org/hunspell-support
[3] http://sourceforge.net/projects/germandict/files/Morfologik/

Am 11.10.2014 12:00, schrieb Daniel Naber:
> Hi,
> 
> to provide LT as a 100% pure Java software, I'd like to switch from 
> Hunspell (native code) to Morfologik (Java-based). For that, I think the 
> following languages are easy to switch:
> 
>      Asturian
>      Galician
>      Khmer
>      Spanish
>      Tagalog
>      Esperanto
>      Icelandic
> 
> Does anybody see a problem with me switching those languages to 
> Morfologik? For Esperanto and Icelandic this would also have the benefit 
> that they can then offer suggestions for typos.
> 
> Does anybody see a problem with that? Other languages (fr, de, sv, 
> pt-BR) might not be easy to switch, please see 
> https://github.com/languagetool-org/languagetool/issues/199 for details.
> 
> Regards
>   Daniel
> 
> 
> ------------------------------------------------------------------------------
> Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
> Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
> Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
> Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
> http://p.sf.net/sfu/Zoho
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
> 

------------------------------------------------------------------------------
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to