-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Matt Kettler writes: > At 03:54 PM 2/15/2005, Austin Weidner wrote: > >I get certain e-mail accounts that are old and JUST GET SPAM (no question > >about it). I set up a script that takes e-mails from these accounts and feds > >them in to sa-learn as SPAM. > > > >I have no HAM's right now, however I have plans to add at least a couple > >hundred to bayes (that is the bare minimum, I believe). > > > >My question is: Is there anything wrong with doing this? > > No.. I see nothing wrong with it.. I do this myself... They're called > "spam traps" by most. > > I also have some carefully guarded "ham traps" that I've carefully > subscribed to well-trusted industry newsletters, etc. I script-feed these > to sa-learn --ham. I also keep a rotating archive of all the learned mail, > so I can go through and review it for contamination. Yes, this is one thing to watch out for. If the spamtrap accounts include some accounts that *were* previously active as user accounts, you *really* need to monitor those for the occasional ham slipping in. In the past, I've seen spamtrap accounts that get thousands of spams, but still have a very old subscription to announcement mailing lists that send maybe one mail every 2 months, for example. - --j. > I just use a simple interval cronjob to do this, and I have the individual > addresses all aliases into one "spam" and one "ham" account. You can also > add things like calls to razor-report, etc. > > (Note I've re-named the mailboxes here with a search/replace. You should > too.. You don't want outsiders being able to recognize your spam trap or > ham trap accounts. You certainly don't want anything as predictable as > "[EMAIL PROTECTED]" as your ham trap.) > > Here's a trimmed down version of my script (no warranties or claims it's > bug-free, etc. Just provided to give you some ideas) > > #!/bin/sh > cd /var/autolearn/ > > if [ -f /var/spool/mail/spam ]; then > echo learning spam mailbox - spam > mv /var/spool/mail/spam . > /usr/bin/sa-learn --spam --mbox spam > rm spam/spam.alearn6.gz > mv spam/spam.alearn5.gz spam/spam.alearn6.gz > mv spam/spam.alearn4.gz spam/spam.alearn5.gz > mv spam/spam.alearn3.gz spam/spam.alearn4.gz > mv spam/spam.alearn2.gz spam/spam.alearn3.gz > > gzip spam/spam.alearn1 > mv spam/spam.alearn1.gz spam/spam.alearn2.gz > > mv spam spam/spam.alearn1 > fi > > if [ -f /var/spool/mail/ham ]; then > echo learning ham mailbox - ham > mv /var/spool/mail/ham . > /usr/bin/sa-learn --ham --mbox ham > rm ham/ham.alearn6.gz > mv ham/ham.alearn5.gz ham/ham.alearn6.gz > mv ham/ham.alearn4.gz ham/ham.alearn5.gz > mv ham/ham.alearn3.gz ham/ham.alearn4.gz > mv ham/ham.alearn2.gz ham/ham.alearn3.gz > > gzip ham/ham.alearn1 > mv ham/ham.alearn1.gz ham/ham.alearn2.gz > > mv ham ham/ham.alearn1 > fi -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.5 (GNU/Linux) Comment: Exmh CVS iD8DBQFCEnOhMJF5cimLx9ARAtk0AJ4lMWErOwVHWtaC6surOO+VpgE8fgCgncOq 8TE+PQHMIaiP6B+HNzvRK+Q= =XnOV -----END PGP SIGNATURE-----