On 2015-01-13 09:10, Marcin Miłkowski wrote:

>> I've removed about correct 1000 example sentences for German as they
>> were redundant, i.e. they just repeated the incorrect example and its
>> 'correction' attribute. Unless someone objects for their language, I
>> will do the same for all languages (the cleanup effect will probably 
>> be
>> much smaller for most other languages).
> 
> I usually use correct examples as sanity (regression) checks, so please
> keep this in mind.

How exactly do you do that? Do you extract the sentences from the XML 
first so you have plain text? In that case, we either keep those 
sentences or you would need to change the process a bit that extracts 
the sentences (building correct sentences from incorrect example plus 
its correction). In Polish, only about 60 sentences would be affected.

Regards
  Daniel


------------------------------------------------------------------------------
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to