On Sun, 2009-09-13 at 20:57 +0100, RW wrote: > On Sun, 13 Sep 2009 14:19:35 +0100 > Clunk Werclick <mailbacku...@googlemail.com> wrote: > > > On Sun, 2009-09-13 at 14:06 +0100, RW wrote: > > > On Sun, 13 Sep 2009 06:56:27 +0100 > > > Clunk Werclick <mailbacku...@googlemail.com> wrote: > > > > > {trimmed down to the relevant point you make} > > > Adding irrelevant text to a spam may make it less likely likely to > > > be caught, > > Thank you. So if your bayes 'good' tokens that happen to catch on this > > 'irrelevant' text, the result of having the bayes is near pointless. > > For example, something like this: > > In practise I find it doesn't make much difference unless the spammer > makes a significant effort to reduce the number of spammy tokens, both > in the headers and the body. And that commonly leads them into hitting > other rules, and constrains the number of spams that can be sent from > the same IP address. The majority of the spams I get don't have such > text and most that do still hit BAYES_99. It's obviously not such a > powerful technique as you think. > > > It's also wrong to assume that when spam hits BAYES_50, BAYES hasn't > done anything useful. This is a fallacy that comes from the arbitrary > assignment of zero to BAYES_50. If you add 2.599 to all the BAYES rules > and than multiply all the rule scores by 0.658 you get an equivalent > scoreset (i.e. one that produces the same classifications) in which > zero is assigned to BAYES_00 instead. We than have: > > BAYES_00 0.00 > BAYES_50 1.71 > BAYES_99 4.01 > > In this scoreset BAYES_50 actually looks like a fairly strong result > (which it is).
OK, I won't dismiss it out of hand and I'm open to observation. So, I'll give bayes a whirl. I must confess that I found the documentation on the simple act of enabling it less than ideal and reference to "use_bayes_rules" is currently missing in action. I've cobbled together: # Enable the Bayes system use_bayes 1 use_bayes_rules 0 bayes_path /home/mail/bayes/bayes bayes_file_mode 0777 # Enable Bayes auto-learning bayes_auto_learn 0 And trained some spam and I'll see how we get on. -- ----------------------------------------------------------- C Werclick .Lot Technical incompetent Loyal Order Of The Teapot. This e-mail and its attachments is intended only to be used as an e-mail and an attachment. Any use of it for other purposes other than as an e-mail and an attachment will not be covered by any warranty that may or may not form part of this e-mail and attachment.