W dniu 2014-01-02 14:22, Daniel Naber pisze:
> On 2013-12-31 15:04, Marcin Miłkowski wrote:
>
>> I'd get rid of these multiple replacements. They are spurious anyway, I
>> guess. Some of them seem to be repeated (I just copied these from
>> hunspell). I will look at them again.
>
> Thanks for cleaning up the replacement pairs. As this might affect quite
> some users, I'd like to release a version 2.4.1 with this fix. Any
> objections? Any other changes that should go into 2.4.1? (as this
> doesn't affect LibreOffice/OO we won't need a new *.oxt unless we add
> other fixes)

I found out that we failed to give proper suggestions for all standard 
contractions so I added a special rule to handle them (instead of 
changing the MorfologikSpeller rules) and errors in contractions are 
ignored by MorfologikSpeller rules as well. From the user perspective, 
there's no difference (red highlight etc.), but it's much easier to 
support for us.

Also, tokenization has been changed for English to exclude an 
word-ending hyphen from words. This makes spell-checking easier and 
faster as hyphens frequently occur at the border of English words ("two- 
and fourfold increase").

It seems we have a lot of possible typos with contractions that might go 
undetected as they are ambiguous (for example "Hell go to Miami" instead 
of "He'll go to Miami"). New rules are needed to handle those.

I think we should test my changes (especially the contraction rule) on 
the nightly diff. I don't think it should create too much noise so maybe 
it might be use standardly to run on wikipedia.

Best,
Marcin

------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to