Re: updated ngram data

2015-08-17 Thread Daniel Naber
On 2015-08-17 17:57, Andre Couture wrote: > (default task-2) Format version is not supported (resource > MMapIndexInput(path="/google-ngram-index/3grams/_1e4_Lucene50_0.tim")): > 1 (needs to be between 0 and 0) > > (default task-2) RuleMatch error unhandled, type =Typographical > > Do I simply n

Re: updated ngram data

2015-08-17 Thread Andre Couture
Hi, I have downloaded the new zip and expanded but seem that (default task-2) Using LanguageTool 3.1-SNAPSHOT (LanguageTool-20150713-snapshot) (default task-2) Supported languages: 45 (default task-2) Supported rules: 1277 (default task-2) Format version is not supported (resource MMapIndexInpu

updated ngram data

2015-08-17 Thread Daniel Naber
Hi, I've updated our ngram data that is used to power our English homophone confusion rule. This is the rule that finds many cases where words like there/their, breathe/breath etc are confused. The new ngram data is based on the Google ngram data from 2012 and it's much larger than the previou