On 2015-01-13 09:10, Marcin Miłkowski wrote: >> I've removed about correct 1000 example sentences for German as they >> were redundant, i.e. they just repeated the incorrect example and its >> 'correction' attribute. Unless someone objects for their language, I >> will do the same for all languages (the cleanup effect will probably >> be >> much smaller for most other languages). > > I usually use correct examples as sanity (regression) checks, so please > keep this in mind.
How exactly do you do that? Do you extract the sentences from the XML first so you have plain text? In that case, we either keep those sentences or you would need to change the process a bit that extracts the sentences (building correct sentences from incorrect example plus its correction). In Polish, only about 60 sentences would be affected. Regards Daniel ------------------------------------------------------------------------------ New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel