Hi,

I wonder if we could use the switch to Morfologik as an opportunity to
rethink our general approach to dictionaries.

Currently we use two dictionaries for all the fully supported languages
afaik, and those contribute considerably to the large download size of
LanguageTool.

Why not use just one dictionary per language and keep all the necessary
data in one well organised place? This large word database could contain
everything we need: tags and base form for the grammar checking,
frequency information for the spelling suggestions, just about
everything. Even if we want the dictionary to contain incorrect/outdated
spellings for tagging purposes, all we need is a one-bit flag that tells
the spell-checking routine if a word is misspelled.

Cheers,
Jan


Am 11.10.2014 12:00, schrieb Daniel Naber:
> Hi,
> 
> to provide LT as a 100% pure Java software, I'd like to switch from 
> Hunspell (native code) to Morfologik (Java-based).

------------------------------------------------------------------------------
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to