-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Matt Kettler writes:
> At 03:54 PM 2/15/2005, Austin Weidner wrote:
> >I get certain e-mail accounts that are old and JUST GET SPAM (no question
> >about it). I set up a script that takes e-mails from these accounts and feds
> >them in to sa-learn as SPAM.
> >
> >I have no HAM's right now, however I have plans to add at least a couple
> >hundred to bayes (that is the bare minimum, I believe).
> >
> >My question is: Is there anything wrong with doing this?
> 
> No.. I see nothing wrong with it.. I do this myself...  They're called 
> "spam traps" by most.
> 
> I also have some carefully guarded "ham traps" that I've carefully 
> subscribed to well-trusted industry newsletters, etc. I script-feed these 
> to sa-learn --ham. I also keep a rotating archive of all the learned mail, 
> so I can go through and review it for contamination.

Yes, this is one thing to watch out for.  If the spamtrap accounts include
some accounts that *were* previously active as user accounts, you
*really* need to monitor those for the occasional ham slipping in.

In the past, I've seen spamtrap accounts that get thousands of spams, but
still have a very old subscription to announcement mailing lists that
send maybe one mail every 2 months, for example.

- --j.

> I just use a simple interval cronjob to do this, and I have the individual 
> addresses all aliases into one "spam" and one "ham" account. You can also 
> add things like calls to razor-report, etc.
> 
> (Note I've re-named the mailboxes here with a search/replace. You should 
> too.. You don't want outsiders being able to recognize your spam trap or 
> ham trap accounts. You certainly don't want anything as predictable as 
> "[EMAIL PROTECTED]" as your ham trap.)
> 
> Here's a trimmed down version of my script (no warranties or claims it's 
> bug-free, etc. Just provided to give you some ideas)
> 
> #!/bin/sh
> cd /var/autolearn/
> 
> if [ -f /var/spool/mail/spam ]; then
>   echo learning spam mailbox - spam
>   mv /var/spool/mail/spam .
>   /usr/bin/sa-learn --spam --mbox spam
>   rm spam/spam.alearn6.gz
>   mv spam/spam.alearn5.gz spam/spam.alearn6.gz
>   mv spam/spam.alearn4.gz spam/spam.alearn5.gz
>   mv spam/spam.alearn3.gz spam/spam.alearn4.gz
>   mv spam/spam.alearn2.gz spam/spam.alearn3.gz
> 
>   gzip spam/spam.alearn1
>   mv spam/spam.alearn1.gz spam/spam.alearn2.gz
> 
>   mv spam spam/spam.alearn1
> fi
> 
> if [ -f /var/spool/mail/ham ]; then
>   echo learning ham mailbox - ham
>   mv /var/spool/mail/ham .
>   /usr/bin/sa-learn --ham --mbox ham
>   rm ham/ham.alearn6.gz
>   mv ham/ham.alearn5.gz ham/ham.alearn6.gz
>   mv ham/ham.alearn4.gz ham/ham.alearn5.gz
>   mv ham/ham.alearn3.gz ham/ham.alearn4.gz
>   mv ham/ham.alearn2.gz ham/ham.alearn3.gz
> 
>   gzip ham/ham.alearn1
>   mv ham/ham.alearn1.gz ham/ham.alearn2.gz
> 
>   mv ham ham/ham.alearn1
> fi
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.5 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFCEnOhMJF5cimLx9ARAtk0AJ4lMWErOwVHWtaC6surOO+VpgE8fgCgncOq
8TE+PQHMIaiP6B+HNzvRK+Q=
=XnOV
-----END PGP SIGNATURE-----

Reply via email to