Hi;

I have been following this discussion and it seems like for weight test it
would be good.  Some observations that could complement this:

1:  Mailing list email addresses are long.  I have not seen autogenerated
addresses that are less than 10 or so characters.  E.g.

[EMAIL PROTECTED] [64.241.105.8]

[EMAIL PROTECTED]

But on the other hand spam like emails are typically about 10 or so
characters.  I think it is worth looking into John's suggestion with a
consideration of the UserID length. E.g. from last night logs:

[EMAIL PROTECTED]
[EMAIL PROTECTED]
[EMAIL PROTECTED]
[EMAIL PROTECTED]
[EMAIL PROTECTED]

I think we can use the length of the UserID to our advantage in implementing
this test.

2:  I wish we could run tests on UserID and domain separately.  It seems
like it would be much easier if the domain could be separated from the
UserID since for example one could test for two dashes (--) in the domain.
We are getting more & more spam like hot--stuff.com

Regards,
Kami


-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Markus Gufler
Sent: Thursday, September 11, 2003 7:16 AM
To: [EMAIL PROTECTED]
Subject: RE: [Declude.JunkMail] New test request


> > How about a test like this:
> > NUMBERSINMAILFROM
> > 
> > It would be similar to SUBJECTSPACES but would count the amount of
> > numbers in the mail from address. You could then configure 
> > it for say if 10 or more,
> > add 5 to the weight and so forth.

John,

We already look for sender-addresses containing more then 4
(SenderWithCodeMaybe) or more then 8 digits (SenderWithCode). So we count
around 75% of spam-senders and 25% of FPs.

As Scott sayd there are a lot of tipical Freemailer-Addresses like
"[EMAIL PROTECTED]" creating FPs with such a test. But there are also
auto-generated mailings having a sender address like
"[EMAIL PROTECTED]"

On a tipical day we can see around 10% of all incomming messages having
between 4 and 7 digits. Other ~8% of incomming messages has more then 8
digits.

It's not the best but a definitively usefull test in a weighting system.


Markus


---
[This E-mail was scanned for viruses by Declude Virus
(http://www.declude.com)]

---
This E-mail came from the Declude.JunkMail mailing list.  To unsubscribe,
just send an E-mail to [EMAIL PROTECTED], and type "unsubscribe
Declude.JunkMail".  The archives can be found at
http://www.mail-archive.com.

---
[This E-mail was scanned for viruses by Declude Virus (http://www.declude.com)]

---
This E-mail came from the Declude.JunkMail mailing list.  To
unsubscribe, just send an E-mail to [EMAIL PROTECTED], and
type "unsubscribe Declude.JunkMail".  The archives can be found
at http://www.mail-archive.com.

Reply via email to