I'm currently running spamassassin on my mailserver where my users primarily use IMAP to connect.
I want to set up two folders for each user, "spam_to_learn" and "ham_to_learn", so that when they get uncaught spam or caught ham, they can just move the message to the appropriate folder, and once per day my script will run to push the mbox file through sa-learn.
There's two things I'm concerned about:
1. the "hidden" imap message that always sits at the start of the mbox file. Based on my initial tests, it looks like that hidden message gets processed by sa-learn, which tells me token info about my mailserver etc. are getting inadvertently marked as spamy tokens. Is the the case, or should I be worried about it? If this is the case then I can write a routine to flush out the hidden message, but why bother if i don't need to...
2. Ham that got marked as spam. Since spam messages get altered so that the spammy message becomes an attachment to the spam info, when I get a ham that got caught as spam, I suspect that I need to move the "original" message to my ham folder, not the "altered" message? ie. If I move the altered message to the ham folder, the message will contain all of the spam info and the actual message will still be an attachement. the alternative is that they need to open the attachment to see the original ham message and then move THAT message into the ham folder. ???
thoughts?
regards,
Paul
Paul
I use a perl script to dig out the imap based emails one at a time and then push them through sa-learn. Saves having to have local mbox for the learning..
If you'd like a copy, email me off list and I'll forward it along with how to run the thing.
There's also a setting (usually placed in local.cf) to ignore the SA-headers whilst learning. This is..
bayes_ignore_header X-Spam bayes_ignore_header X-Spam-SpamCheck bayes_ignore_header X-Spam-SpamScore bayes_ignore_header X-Spam-Information
You'll need to alter these to your local settings..
-- Martin Hepworth Snr Systems Administrator Solid State Logic Tel: +44 (0)1865 842300
**********************************************************************
This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager.
This footnote confirms that this email message has been swept for the presence of computer viruses and is believed to be clean.
**********************************************************************
