Hi, we're using a bit probability theory to calculate ngram probabilities. This way we can decide which word of a homophone pair like there/their is (probably) correct. Is anybody here familiar with probability theory and could review that code? The main part is here:
https://github.com/languagetool-org/languagetool/blob/master/languagetool-core/src/main/java/org/languagetool/rules/ConfusionProbabilityRule.java#L309 and line 268 There are other issues where I could need help from an expert. For example, results don't get better when we use 4grams instead of 3grams. Regards Daniel ------------------------------------------------------------------------------ _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel