Re: sa-learn and "Caught" spams

Kelson Thu, 28 Sep 2006 11:05:52 -0700

Daniel Staal wrote:

Depends on the setup. For instance, given the explanations above, I'llstart a system to automatically learn from my 'checkspam' folder, butnot my 'highspam' folder. I have procmail automatically sort my spam byscore, so I can pay extra attention to low-scoring spam. (Which is morelikely to be ham which was misplaced than the high-scoring spam.)
So, since I *already* have them separated out, I can avoid thedouble-check. ;)

But the final score alone doesn't determine whether something getsautolearned.

As Matt pointed out, there are a number of different factors, includingthe mix of head/body tests and the current Bayes score -- and it acts onwhat the score would have been if Bayes had been disabled.

So unless you've filtered on the "autolearn=(ham|spam|no)" tag in theX-Spam-Status header, you could be missing some high-scoring spam thathasn't already been learned.

You could probably filter your training folder to remove any messageswhere X-Spam-Status contains "autolearn=spam" (assuming, of course, thatyour server takes full control of that header). That should berelatively fast and cut down on the resources used to identify duplicates.


--
Kelson Vibber
SpeedGate Communications <www.speed.net>

Re: sa-learn and "Caught" spams

Reply via email to