On Tue, Apr 21, 2009 at 3:58 PM, Gene Heskett <gene.hesk...@verizon.net> wrote: > On Tuesday 21 April 2009, alexus wrote: >>On Tue, Apr 21, 2009 at 1:21 AM, Gene Heskett <gene.hesk...@verizon.net> > wrote: >>> On Monday 20 April 2009, alexus wrote: >>>>i'm trying to teach my SA whats spam >>>> >>>>it's a brand new out of box SA, i have few domains that i dont get >>>>anything but a spam and on the top seems like from same spamers as >>>>they "picked" emails that they thought would be good to spam and keep >>>>on spaming them >>>> >>>>so i do sa-learn --spam * >>>>after a while it saying something like >>>> >>>>Learned tokens from 52 message(s) (52 message(s) examined) >>>> >>>>yet, when more of some what same email comes in it still can't >>>>determinate if its spam or not... >>>> >>>>am i doing something wrong? or is sa-learn isn't suppose to work as i >>>>thought it would.. >>> >>> You need to have it learn at least 200 messages of both 'ham' and 'spam' >>> before it has enough data to switch to working mode. So sort them into >>> separate directories, and have it learn both a clean inbox as ham, and an >>> all spam directory. When it has learned those, it keep track and will not >>> learn those particular emails again, so clean the spam box, just delete >>> its contents. I even use a cleaned up, sorted to separate directories >>> mailing list as ham just so it knows stuff from that list is generally >>> ham. I had one list that I never figured out what was spammy about it, >>> and since the corpus of that list went back several years, I fed the whole >>> thing to SA as ham. Took it several hours but no more problems with that >>> lists messages now. Now, the spam that does get through goes into a spam >>> dir, and a cron job learns it, then deletes it daily. I'm lazy, and >>> repetitive tasks are to be done by a cron fired script around this camp. >>> :) >>> >>> -- >>> Cheers, Gene >>> "There are four boxes to be used in defense of liberty: >>> soap, ballot, jury, and ammo. Please use in that order." >>> -Ed Howdershelt (Author) >>> Any two philosophers can tell each other all they know in two hours. >>> -- Oliver Wendell Holmes, Jr. >> >>how do I change my SA from learning mode to working mode? > > I believe that is automatic once it has enough data. See above, 200 msgs of > each type required IIRC. > > Understand that SA only rates the email, and puts its findings in the header. > It is up to you to determine what is done with mail that is too spammy. I use > procmail as the MTA from fetchmail, and procmail is configured to send > anything that SA labels with 5 stars or over to /dev/null. > > -- > Cheers, Gene > "There are four boxes to be used in defense of liberty: > soap, ballot, jury, and ammo. Please use in that order." > -Ed Howdershelt (Author) > Delta: The kids will love our inflatable slides. -- David Letterman > >
an example Received: by simscan 1.4.0 ppid: 97779, pid: 97780, t: 3.8809s scanners: regex: 1.4.0 clamav: 0.95/m:50/d:9252 spam: 3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on mx1.alexus.biz X-Spam-Level: **** X-Spam-Status: No, score=4.9 required=5.0 tests=BAYES_99,HTML_MESSAGE, MIME_HTML_ONLY,SPF_HELO_PASS autolearn=no version=3.2.5 it gave BAYES_99, yet it still think it's autolearn=no, and it still doesnt think this is SPAM -- http://alexus.org/