Re: complete false hits for BASE64 and LW_STOCK_SPAM4

Jo Rhett Wed, 21 Feb 2007 16:14:45 -0800

On Feb 21, 2007, at 3:02 PM, Kris Deugau wrote:

-> (at least, once Bayes was part of SA <g>) feed missed spam backinto Bayes manually to complement the autolearning (which workedpretty well for me, and without which I'd have very VERY little hamlearned at all).

I spent about a year training a good bayes corpus on one account, andleaving bayes disabled on two others. The difference in spam caughtwas a fraction of a percent, and when spammers started includingtechnical mailing list chatter into their bayes-busting e-mails Istarted having lots of false positives on the bayes-enabled account.It simply doesn't pay off.

Most third-party rules are scored to get spam over that thresholdof 5 largely because, IME, most people seem to be quite happy toleave it at 5; if you're running a lower score, you WILL see FPsunless you *drop* the scores on some of the heavier rules. Iprobably saw at one point; what scores are these FPs getting onyour system?

7-9. The reason I run at 3.8 is because I have 0 - none, null, void- FPs in between 3.8 and 5.0. The very few FPs I see are SPFfailures which I score fairly harshly, and that starts at 7.0.

I used to have low FPs on code segments until I relegated thechickenpox rulesets to 0.1 each. In fact, I plan to run a ruletestbecause I never see chickenpox on real spam, so I'm pretty sure thoserules are useless now.

I've had ONE customer that I ended up dropping the threshold to4.8, because they kept getting spam that was *just* under 5. (Ithink I bumped it back up to 4.9 because of FPs. *sigh*) IIRCthey're also the only customer that regularly seems to get pornspam(tagged or otherwise).

I can't imagine running at 4.8. A quick check confirms that greaterthan 600 spam messages would have hit this mailbox today. That'sjust this one, and nevermind hostmaster/webmaster/etc that get nailedharshly.


I don't have that kind of time.

--
Jo Rhett

Net Consonance : consonant endings by net philanthropy, open sourceand other randomness

Re: complete false hits for BASE64 and LW_STOCK_SPAM4

Reply via email to