Hi,

we're using a bit probability theory to calculate ngram probabilities. 
This way we can decide which word of a homophone pair like there/their 
is (probably) correct. Is anybody here familiar with probability theory 
and could review that code? The main part is here:

https://github.com/languagetool-org/languagetool/blob/master/languagetool-core/src/main/java/org/languagetool/rules/ConfusionProbabilityRule.java#L309
 
and line 268

There are other issues where I could need help from an expert. For 
example, results don't get better when we use 4grams instead of 3grams.

Regards
  Daniel


------------------------------------------------------------------------------
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to