Re: Is Bayes Dead? Have the spammers won?

Marc Perkel Fri, 23 Mar 2007 14:44:43 -0800


Jim Maul wrote:

Marc Perkel wrote:
Jim Maul wrote:
Marc Perkel wrote:
Perhaps what I need to do is to get rid of autolearn and write myown learning system that strips out the body of messages withimages and just learns the headers. My problem is that when usersget image spam they put it in the spam folders and they getlearned. But the text in the image spam causes ham type text to belearned as spam. That causes ham to get higher scores.
Are you sure of this? Have you also trained these ham messages tocounter this effect? Not too long ago we were in the samesituation. I have autolearn enabled but I have adjusted thethresholds to avoid learning false positives/negatives. We weregetting ham (although arguably - they were newsletter type ham) thatwas hitting BAYES_99. As soon as i started training them as ham theproblem went away. Spam is still detected correctly by bayes andthese newsletters no longer hit bayes_99.
-Jim
What I think my problem might be is that I have done so much workprescreening messages with Exim that what's left isn't good stock forautolearn. I think what I need is a separate dedicated learner serverthat is selective and smart about what it learns.
This is quite possible. I have heard other stories of people usingthings like greylisting and rbls to reject at smtp time that the onlythings that eventually made it to SA were so limited that it wouldproduce odd results for bayes. From my experience, the more you throwat bayes, the better it gets. The more selective you are, the less ithas to work with.
Jim

Yes - I think that's what's happening to me. I also create an automaticwhitelisting system that shaves off about 1/2 of ham bypassing SA. WhatI need to do is fork off a copy of a lot of email that's bypassing SAand stuff it into the learner. Like I said originally, bayes used to bemy best tool. I'd like to get that back.

Re: Is Bayes Dead? Have the spammers won?

Reply via email to