I get certain e-mail accounts that are old and JUST GET SPAM (no question about it). I set up a script that takes e-mails from these accounts and feds them in to sa-learn as SPAM.
I have no HAM's right now, however I have plans to add at least a couple hundred to bayes (that is the bare minimum, I believe).
My question is: Is there anything wrong with doing this?
No.. I see nothing wrong with it.. I do this myself... They're called "spam traps" by most.
I also have some carefully guarded "ham traps" that I've carefully subscribed to well-trusted industry newsletters, etc. I script-feed these to sa-learn --ham. I also keep a rotating archive of all the learned mail, so I can go through and review it for contamination.
I just use a simple interval cronjob to do this, and I have the individual addresses all aliases into one "spam" and one "ham" account. You can also add things like calls to razor-report, etc.
(Note I've re-named the mailboxes here with a search/replace. You should too.. You don't want outsiders being able to recognize your spam trap or ham trap accounts. You certainly don't want anything as predictable as "[EMAIL PROTECTED]" as your ham trap.)
Here's a trimmed down version of my script (no warranties or claims it's bug-free, etc. Just provided to give you some ideas)
#!/bin/sh cd /var/autolearn/
if [ -f /var/spool/mail/spam ]; then echo learning spam mailbox - spam mv /var/spool/mail/spam . /usr/bin/sa-learn --spam --mbox spam rm spam/spam.alearn6.gz mv spam/spam.alearn5.gz spam/spam.alearn6.gz mv spam/spam.alearn4.gz spam/spam.alearn5.gz mv spam/spam.alearn3.gz spam/spam.alearn4.gz mv spam/spam.alearn2.gz spam/spam.alearn3.gz
gzip spam/spam.alearn1 mv spam/spam.alearn1.gz spam/spam.alearn2.gz
mv spam spam/spam.alearn1 fi
if [ -f /var/spool/mail/ham ]; then echo learning ham mailbox - ham mv /var/spool/mail/ham . /usr/bin/sa-learn --ham --mbox ham rm ham/ham.alearn6.gz mv ham/ham.alearn5.gz ham/ham.alearn6.gz mv ham/ham.alearn4.gz ham/ham.alearn5.gz mv ham/ham.alearn3.gz ham/ham.alearn4.gz mv ham/ham.alearn2.gz ham/ham.alearn3.gz
gzip ham/ham.alearn1 mv ham/ham.alearn1.gz ham/ham.alearn2.gz
mv ham ham/ham.alearn1 fi