Re[4]: [sniffer] Charset

2004-08-20 Thread Pete McNeil
On Friday, August 20, 2004, 2:35:35 AM, Michiel wrote:

MP Pete, even your message had a chaset header:

MP Content-Type: text/plain; charset=us-ascii

Yes, a tricky gadget indeed.

MP I think you'll generate more FP's if you do something like that than FN's
MP you might have now. Aren't there spamassassin config files that detect this
MP spam?

Just to be clear - we're not precisely talking about spam per-se.
Rather we're talking about stating that all traffic on a particular
system should be only in one language as a matter of policy...

The distinction is small I suppose, but in my mind important. In
filtering spam we're usually trying to target only messages that are
unsolicited commercial email, pornography, or somehow harmful... With
this other approach instead of trying to defeat what we don't want, we
are trying to only accept what we do want... Not so much putting up
blocks, more like putting up a huge block and punching holes.

There are some SA filters that do this kind of thing...
Ultimately I think it boils down to filtering out anything with a
charset that is not wanted.

If we achieve this by attrition (rather than attempting to capture all
of the charsets at once) then we will achieve a strong result quickly
at a relatively low cost and we might avoid potential false positives
that are out there.

MHO,
_M




This E-Mail came from the Message Sniffer mailing list. For information and 
(un)subscription instructions go to 
http://www.sortmonster.com/MessageSniffer/Help/Help.html


Re: Re[4]: [sniffer] Charset

2004-08-20 Thread Scott Fisher
-Mad,

How set up is Message Sniffer to determine if an e-mail in a foreign
language is spam and then code for it.
I dutifully submit my Spanish spam to the spam at sortmonster.com address.
It's a very, very small percentage of my overall spam, but it consistently
lands in my battleground grey-weight ranges.

I only ask, because I have seen the amount of non-English spam trending
upwards. I've noticed spam here in Russian, German, Spanish, Korean,
Portuguese and Chinese.

- Original Message - 
From: Pete McNeil [EMAIL PROTECTED]
To: Michiel Prins [EMAIL PROTECTED]
Sent: Friday, August 20, 2004 7:04 AM
Subject: Re[4]: [sniffer] Charset


 On Friday, August 20, 2004, 2:35:35 AM, Michiel wrote:

 MP Pete, even your message had a chaset header:

 MP Content-Type: text/plain; charset=us-ascii

 Yes, a tricky gadget indeed.

 MP I think you'll generate more FP's if you do something like that than
FN's
 MP you might have now. Aren't there spamassassin config files that detect
this
 MP spam?

 Just to be clear - we're not precisely talking about spam per-se.
 Rather we're talking about stating that all traffic on a particular
 system should be only in one language as a matter of policy...

 The distinction is small I suppose, but in my mind important. In
 filtering spam we're usually trying to target only messages that are
 unsolicited commercial email, pornography, or somehow harmful... With
 this other approach instead of trying to defeat what we don't want, we
 are trying to only accept what we do want... Not so much putting up
 blocks, more like putting up a huge block and punching holes.

 There are some SA filters that do this kind of thing...
 Ultimately I think it boils down to filtering out anything with a
 charset that is not wanted.

 If we achieve this by attrition (rather than attempting to capture all
 of the charsets at once) then we will achieve a strong result quickly
 at a relatively low cost and we might avoid potential false positives
 that are out there.

 MHO,
 _M




 This E-Mail came from the Message Sniffer mailing list. For information
and (un)subscription instructions go to
http://www.sortmonster.com/MessageSniffer/Help/Help.html





This E-Mail came from the Message Sniffer mailing list. For information and 
(un)subscription instructions go to 
http://www.sortmonster.com/MessageSniffer/Help/Help.html