Re: Is Bayes Dead? Have the spammers won?

Kris Deugau Thu, 22 Mar 2007 10:19:27 -0800

John D. Hardin wrote:

I've never trusted automatic learning. Why let your Bayes database be(even partially) under the control of a third party, particularlywhen that third party is the attacker?

Because there's no other (practical and/or ethical) way of gettingenough ham to make it useful?

Anyone using SA in an ISP environment will run into this problem; aboutthe only way I can see to legitimately get any real volume of ham is tosend customers' outbound mail into a learning queue somewhere. Eventhat has its limits and issues - for instance, the fact that any ISPlarger than a few thousand customers will likely have completelyseparate paths for inbound and outbound mail, which *will* affect theusefulness of the learning. :/

I've been running the same Bayes databases on one system and my personalemail since I upgraded from SA2.44 to 2.54 and started using Bayes; I'dbe running the original Bayes DB on another system if I had figured outI *could* just continue to use the exact same files upgrading2.64->3.1.7 at the time.

Accuracy on the continuous-use databases hasn't suffered for theautolearning, so far as I can tell... but the more out-of-date SAitself got the worse it was at tagging spam.

I *do* regularly feed back both my own missed-spams (my account, andthree role accounts), as well as customer-submitted missed-spam. Latelythere have only been four or five (reported) FNs per day, across thewhole system.


-kgd

Re: Is Bayes Dead? Have the spammers won?

Reply via email to