Re: Big in size, etc

2014-01-30 Thread Marcin Miłkowski
W dniu 2014-01-30 04:14, Kumara Bhikkhu pisze:
 Of what you listed, these are rightly flagged:

   * few in number
   * religious in nature
   * autocratic in nature,

 Removing the 2nd and 3rd token wouldn't cause any lost of meaning.

Well, in general, I'm skeptical about inclusion of in nature. In many 
instances, it means in fact, which is just a matter of stress (but not 
redundant or pompous at all). But there is a number of uses where it 
means by necessity or in essence, which argues that something 
essentially or necessarily has a certain property. This is proper usage, 
and your rule would conflate necessity judgments and actuality 
judgments, which is simply a logical fallacy.

Let me quote the passage from the Brown corpus:

The men who speculate on these institutions have, for the most part, 
come to at least one common conclusion: that many of the great 
enterprises and associations around which our democracy is formed are in 
themselves autocratic in nature, and possessed of power which can be 
used to frustrate the citizen who is trying to assert his individuality 
in the modern world.

Here, by saying autocratic, and not autocratic in nature, one would 
change the logical structure of the passage, and make the point 
ambiguous. The writer wanted to say that some apparently democratic 
institutions are autocratic in character, or in their real essence (so 
they are autocratic necessarily, not contingently). To apply the 
correction offered by BIG_IN_SIZE to this passage is to distort the 
logical structure of the passage.

I will probably remove 'nature' from the rule altogether.

Regards,
Marcin


 kb

 Marcin Miłkowski wrote thus at 01:33 AM 30-01-14:
 W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze:
  Hope this is not too greedy.
 
  rule id=BIG_IN_SIZE name=Big in size, etc
  pattern
  token postag_regexp=yes postag=JJ|JJR|JJSexception
  regexp=yessimilar|alone/exceptionexception postag=VBN//token
  tokenin/token
  token
 
 regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token
  /pattern
  messageA more concise phrase may lose no meaning and sound more
  powerful./message
  suggestion\1/suggestion
  shortPossible redundancy/short
  example type=correctThe man is big./example
  example type=correctHis speech was briefest./example
  example type=correctHe's absorbed in thought./example
  example correction=big type=incorrectThe man is markerbig in
  size/marker./example
  example correction=briefest type=incorrectHis speech was
  markerbriefest in duration/marker./example
  example correction=redder type=incorrectMy car is markerredder
  in color/marker./example
  example correction=few type=incorrectHer friends were markerfew
  in number/marker./example
  /rule

 Thanks!

 Yes, this is much better but we also match:

 rich in color

 few in number

 possible in Nature,

 corresponding in number

 religious in nature

 efficient in shape

 autocratic in nature,

 equal in size

 beautiful in color,

 next in size

 Out of 16 matches on Brown corpus, the useful ones were brown in
 color,tawny in color, two in number. I added the adjectives above
 as additional exceptions and we'll see how it performs. I'm not sure
 about 'few'. We'll probably see more matches in the nightly diff today.

 Regards,
 Marcin

 --
 WatchGuard Dimension instantly turns raw network data into actionable
 security intelligence. It gives you real-time visual feedback on key
 security issues and trends.  Skip the complicated setup - simply import
 a virtual appliance and go from zero to informed in seconds.
 http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk

 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel


 --
 WatchGuard Dimension instantly turns raw network data into actionable
 security intelligence. It gives you real-time visual feedback on key
 security issues and trends.  Skip the complicated setup - simply import
 a virtual appliance and go from zero to informed in seconds.
 http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk



 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel



--
WatchGuard Dimension instantly turns raw network data into actionable 
security intelligence. It gives you real-time visual feedback on key
security issues and trends.  Skip the complicated setup - simply import
a virtual appliance and go from zero to informed in seconds.
http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk

Re: Big in size, etc

2014-01-30 Thread Kumara Bhikkhu
ok. agreed.

Marcin Miłkowski wrote thus at 05:03 PM 30-01-14:

W dniu 2014-01-30 04:14, Kumara Bhikkhu pisze:
  Of what you listed, these are rightly flagged:

* few in number
* religious in nature
* autocratic in nature,

  Removing the 2nd and 3rd token wouldn't cause any lost of meaning.

Well, in general, I'm skeptical about inclusion of in nature. In many
instances, it means in fact, which is just a matter of stress (but not
redundant or pompous at all). But there is a number of uses where it
means by necessity or in essence, which argues that something
essentially or necessarily has a certain property. This is proper usage,
and your rule would conflate necessity judgments and actuality
judgments, which is simply a logical fallacy.

Let me quote the passage from the Brown corpus:

The men who speculate on these institutions have, for the most part,
come to at least one common conclusion: that many of the great
enterprises and associations around which our democracy is formed are in
themselves autocratic in nature, and possessed of power which can be
used to frustrate the citizen who is trying to assert his individuality
in the modern world.

Here, by saying autocratic, and not autocratic in nature, one would
change the logical structure of the passage, and make the point
ambiguous. The writer wanted to say that some apparently democratic
institutions are autocratic in character, or in their real essence (so
they are autocratic necessarily, not contingently). To apply the
correction offered by BIG_IN_SIZE to this passage is to distort the
logical structure of the passage.

I will probably remove 'nature' from the rule altogether.

Regards,
Marcin


  kb

  Marcin Miłkowski wrote thus at 011:33 AM 30-01-14:
  W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze:
   Hope this is not too greedy.
  
   rule id=BIG_IN_SIZE name=Big in size, etc
   pattern
   token postag_regexp=yes postag=JJ|JJR|JJSexception
   regexp=yessimilar|alone/exceptionexception postag=VBN//token
   tokenin/token
   token
  
  
 regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token
   /pattern
   messageA more concise phrase may lose no meaning and sound more
   powerful./message
   suggestion\1/suggestion
   shortPossible redundancy/short
   example type=correctThe man is big./example
   example type=correctHis speech was briefest./example
   example type=correctHe's absorbed in thought./example
   example correction=big type=incorrectThe man is markerbig in
   size/marker./example
   example correction=briefest type=incorrectHis speech was
   markerbriefest in duration/marker./example
   example correction=redder type=incorrectMy car is markerredder
   in color/marker./example
   example correction=few type=incorrectHer friends were markerfew
   in number/marker./example
   /rule
 
  Thanks!
 
  Yes, this is much better but we also match:
 
  rich in color
 
  few in number
 
  possible in Nature,
 
  corresponding in number
 
  religious in nature
 
  efficient in shape
 
  autocratic in nature,
 
  equal in size
 
  beautiful in color,
 
  next in size
 
  Out of 16 matches on Brown corpus, the useful ones were brown in
  color,tawny in color, two in number. I added the adjectives above
  as additional exceptions and we'll see how it performs. I'm not sure
  about 'few'. We'll probably see more matches in the nightly diff today.
 
  Regards,
  Marcin
 
  
 --
  WatchGuard Dimension instantly turns raw network data into actionable
  security intelligence. It gives you real-time visual feedback on key
  security issues and trends.  Skip the complicated setup - simply import
  a virtual appliance and go from zero to informed in seconds.
  
 http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk
 
  ___
  Languagetool-devel mailing list
  Languagetool-devel@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/languagetool-devel


  
 --
  WatchGuard Dimension instantly turns raw network data into actionable
  security intelligence. It gives you real-time visual feedback on key
  security issues and trends.  Skip the complicated setup - simply import
  a virtual appliance and go from zero to informed in seconds.
  
 http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk



  ___
  Languagetool-devel mailing list
  Languagetool-devel@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/languagetool-devel



--
WatchGuard Dimension instantly turns raw network data into actionable
security intelligence. It gives you real-time visual feedback on key
security issues and trends.  Skip the complicated setup - simply 

Re: Big in size, etc

2014-01-29 Thread Marcin Miłkowski
W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze:
 Hope this is not too greedy.

 rule id=BIG_IN_SIZE name=Big in size, etc
 pattern
 token postag_regexp=yes postag=JJ|JJR|JJSexception
 regexp=yessimilar|alone/exceptionexception postag=VBN//token
 tokenin/token
 token
 regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token
 /pattern
 messageA more concise phrase may lose no meaning and sound more
 powerful./message
 suggestion\1/suggestion
 shortPossible redundancy/short
 example type=correctThe man is big./example
 example type=correctHis speech was briefest./example
 example type=correctHe's absorbed in thought./example
 example correction=big type=incorrectThe man is markerbig in
 size/marker./example
 example correction=briefest type=incorrectHis speech was
 markerbriefest in duration/marker./example
 example correction=redder type=incorrectMy car is markerredder
 in color/marker./example
 example correction=few type=incorrectHer friends were markerfew
 in number/marker./example
 /rule

Thanks!

Yes, this is much better but we also match:

rich in color

few in number

possible in Nature,

corresponding in number

religious in nature

efficient in shape

autocratic in nature,

equal in size

beautiful in color,

next in size

Out of 16 matches on Brown corpus, the useful ones were brown in 
color,tawny in color, two in number. I added the adjectives above 
as additional exceptions and we'll see how it performs. I'm not sure 
about 'few'. We'll probably see more matches in the nightly diff today.

Regards,
Marcin

--
WatchGuard Dimension instantly turns raw network data into actionable 
security intelligence. It gives you real-time visual feedback on key
security issues and trends.  Skip the complicated setup - simply import
a virtual appliance and go from zero to informed in seconds.
http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Big in size, etc

2014-01-29 Thread Kumara Bhikkhu

Of what you listed, these are rightly flagged:
   * few in number
   * religious in nature
   * autocratic in nature,
Removing the 2nd and 3rd token wouldn't cause any lost of meaning.

kb

Marcin Miłkowski wrote thus at 01:33 AM 30-01-14:

W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze:
 Hope this is not too greedy.

 rule id=BIG_IN_SIZE name=Big in size, etc
 pattern
 token postag_regexp=yes postag=JJ|JJR|JJSexception
 regexp=yessimilar|alone/exceptionexception postag=VBN//token
 tokenin/token
 token
 
regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token

 /pattern
 messageA more concise phrase may lose no meaning and sound more
 powerful./message
 suggestion\1/suggestion
 shortPossible redundancy/short
 example type=correctThe man is big./example
 example type=correctHis speech was briefest./example
 example type=correctHe's absorbed in thought./example
 example correction=big type=incorrectThe man is markerbig in
 size/marker./example
 example correction=briefest type=incorrectHis speech was
 markerbriefest in duration/marker./example
 example correction=redder type=incorrectMy car is markerredder
 in color/marker./example
 example correction=few type=incorrectHer friends were markerfew
 in number/marker./example
 /rule

Thanks!

Yes, this is much better but we also match:

rich in color

few in number

possible in Nature,

corresponding in number

religious in nature

efficient in shape

autocratic in nature,

equal in size

beautiful in color,

next in size

Out of 16 matches on Brown corpus, the useful ones were brown in
color,tawny in color, two in number. I added the adjectives above
as additional exceptions and we'll see how it performs. I'm not sure
about 'few'. We'll probably see more matches in the nightly diff today.

Regards,
Marcin

--
WatchGuard Dimension instantly turns raw network data into actionable
security intelligence. It gives you real-time visual feedback on key
security issues and trends.  Skip the complicated setup - simply import
a virtual appliance and go from zero to informed in seconds.
http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
WatchGuard Dimension instantly turns raw network data into actionable 
security intelligence. It gives you real-time visual feedback on key
security issues and trends.  Skip the complicated setup - simply import
a virtual appliance and go from zero to informed in seconds.
http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel