W dniu 2014-01-30 04:14, Kumara Bhikkhu pisze: > Of what you listed, these are rightly flagged: > > * few in number > * religious in nature > * autocratic in nature, > > Removing the 2nd and 3rd token wouldn't cause any lost of meaning.
Well, in general, I'm skeptical about inclusion of "in nature". In many instances, it means "in fact", which is just a matter of stress (but not redundant or pompous at all). But there is a number of uses where it means "by necessity" or "in essence", which argues that something essentially or necessarily has a certain property. This is proper usage, and your rule would conflate necessity judgments and actuality judgments, which is simply a logical fallacy. Let me quote the passage from the Brown corpus: The men who speculate on these institutions have, for the most part, come to at least one common conclusion: that many of the great enterprises and associations around which our democracy is formed are in themselves autocratic in nature, and possessed of power which can be used to frustrate the citizen who is trying to assert his individuality in the modern world. Here, by saying "autocratic", and not "autocratic in nature", one would change the logical structure of the passage, and make the point ambiguous. The writer wanted to say that some apparently democratic institutions are autocratic in character, or in their real essence (so they are autocratic necessarily, not contingently). To apply the "correction" offered by BIG_IN_SIZE to this passage is to distort the logical structure of the passage. I will probably remove 'nature' from the rule altogether. Regards, Marcin > > kb > > Marcin Miłkowski wrote thus at 01:33 AM 30-01-14: >> W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze: >> > Hope this is not too greedy. >> > >> > <rule id="BIG_IN_SIZE" name="Big in size, etc"> >> > <pattern> >> > <token postag_regexp="yes" postag="JJ|JJR|JJS"><exception >> > regexp="yes">similar|alone</exception><exception postag="VBN"/></token> >> > <token>in</token> >> > <token >> > >> regexp="yes">size|duration|color|colour|number|shape|height|nature|length|weight</token> >> > </pattern> >> > <message>A more concise phrase may lose no meaning and sound more >> > powerful.</message> >> > <suggestion>\1</suggestion> >> > <short>Possible redundancy</short> >> > <example type="correct">The man is big.</example> >> > <example type="correct">His speech was briefest.</example> >> > <example type="correct">He's absorbed in thought.</example> >> > <example correction="big" type="incorrect">The man is <marker>big in >> > size</marker>.</example> >> > <example correction="briefest" type="incorrect">His speech was >> > <marker>briefest in duration</marker>.</example> >> > <example correction="redder" type="incorrect">My car is <marker>redder >> > in color</marker>.</example> >> > <example correction="few" type="incorrect">Her friends were <marker>few >> > in number</marker>.</example> >> > </rule> >> >> Thanks! >> >> Yes, this is much better but we also match: >> >> rich in color >> >> few in number >> >> possible in Nature, >> >> corresponding in number >> >> religious in nature >> >> efficient in shape >> >> autocratic in nature, >> >> equal in size >> >> beautiful in color, >> >> next in size >> >> Out of 16 matches on Brown corpus, the useful ones were "brown in >> color","tawny in color", "two in number". I added the adjectives above >> as additional exceptions and we'll see how it performs. I'm not sure >> about 'few'. We'll probably see more matches in the nightly diff today. >> >> Regards, >> Marcin >> >> ------------------------------------------------------------------------------ >> WatchGuard Dimension instantly turns raw network data into actionable >> security intelligence. It gives you real-time visual feedback on key >> security issues and trends. Skip the complicated setup - simply import >> a virtual appliance and go from zero to informed in seconds. >> http://pubads.g.doubleclick.net/gampad/clk?id=123612991&iu=/4140/ostg.clktrk >> >> _______________________________________________ >> Languagetool-devel mailing list >> Languagetool-devel@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/languagetool-devel > > > ------------------------------------------------------------------------------ > WatchGuard Dimension instantly turns raw network data into actionable > security intelligence. It gives you real-time visual feedback on key > security issues and trends. Skip the complicated setup - simply import > a virtual appliance and go from zero to informed in seconds. > http://pubads.g.doubleclick.net/gampad/clk?id=123612991&iu=/4140/ostg.clktrk > > > > _______________________________________________ > Languagetool-devel mailing list > Languagetool-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > ------------------------------------------------------------------------------ WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991&iu=/4140/ostg.clktrk _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel