At 03:54 PM 2/15/2005, Austin Weidner wrote:
I get certain e-mail accounts that are old and JUST GET SPAM (no question
about it). I set up a script that takes e-mails from these accounts and feds
them in to sa-learn as SPAM.

I have no HAM's right now, however I have plans to add at least a couple
hundred to bayes (that is the bare minimum, I believe).

My question is: Is there anything wrong with doing this?

No.. I see nothing wrong with it.. I do this myself... They're called "spam traps" by most.


I also have some carefully guarded "ham traps" that I've carefully subscribed to well-trusted industry newsletters, etc. I script-feed these to sa-learn --ham. I also keep a rotating archive of all the learned mail, so I can go through and review it for contamination.


I just use a simple interval cronjob to do this, and I have the individual addresses all aliases into one "spam" and one "ham" account. You can also add things like calls to razor-report, etc.


(Note I've re-named the mailboxes here with a search/replace. You should too.. You don't want outsiders being able to recognize your spam trap or ham trap accounts. You certainly don't want anything as predictable as "[EMAIL PROTECTED]" as your ham trap.)

Here's a trimmed down version of my script (no warranties or claims it's bug-free, etc. Just provided to give you some ideas)

#!/bin/sh
cd /var/autolearn/

if [ -f /var/spool/mail/spam ]; then
 echo learning spam mailbox - spam
 mv /var/spool/mail/spam .
 /usr/bin/sa-learn --spam --mbox spam
 rm spam/spam.alearn6.gz
 mv spam/spam.alearn5.gz spam/spam.alearn6.gz
 mv spam/spam.alearn4.gz spam/spam.alearn5.gz
 mv spam/spam.alearn3.gz spam/spam.alearn4.gz
 mv spam/spam.alearn2.gz spam/spam.alearn3.gz

 gzip spam/spam.alearn1
 mv spam/spam.alearn1.gz spam/spam.alearn2.gz

 mv spam spam/spam.alearn1
fi

if [ -f /var/spool/mail/ham ]; then
 echo learning ham mailbox - ham
 mv /var/spool/mail/ham .
 /usr/bin/sa-learn --ham --mbox ham
 rm ham/ham.alearn6.gz
 mv ham/ham.alearn5.gz ham/ham.alearn6.gz
 mv ham/ham.alearn4.gz ham/ham.alearn5.gz
 mv ham/ham.alearn3.gz ham/ham.alearn4.gz
 mv ham/ham.alearn2.gz ham/ham.alearn3.gz

 gzip ham/ham.alearn1
 mv ham/ham.alearn1.gz ham/ham.alearn2.gz

 mv ham ham/ham.alearn1
fi






Reply via email to