Re: sa-learn

alexus Tue, 21 Apr 2009 13:03:50 -0700

On Tue, Apr 21, 2009 at 3:58 PM, Gene Heskett <gene.hesk...@verizon.net> wrote:
> On Tuesday 21 April 2009, alexus wrote:
>>On Tue, Apr 21, 2009 at 1:21 AM, Gene Heskett <gene.hesk...@verizon.net>
> wrote:
>>> On Monday 20 April 2009, alexus wrote:
>>>>i'm trying to teach my SA whats spam
>>>>
>>>>it's a brand new out of box SA, i have few domains that i dont get
>>>>anything but a spam and on the top seems like from same spamers as
>>>>they "picked" emails that they thought would be good to spam and keep
>>>>on spaming them
>>>>
>>>>so i do sa-learn --spam *
>>>>after a while it saying something like
>>>>
>>>>Learned tokens from 52 message(s) (52 message(s) examined)
>>>>
>>>>yet, when more of some what same email comes in it still can't
>>>>determinate if its spam or not...
>>>>
>>>>am i doing something wrong? or is sa-learn isn't suppose to work as i
>>>>thought it would..
>>>
>>> You need to have it learn at least 200 messages of both 'ham' and 'spam'
>>> before it has enough data to switch to working mode.  So sort them into
>>> separate directories, and have it learn both a clean inbox as ham, and an
>>> all spam directory.  When it has learned those, it keep track and will not
>>> learn those particular emails again, so clean the spam box, just delete
>>> its contents.  I even use a cleaned up, sorted to separate directories
>>> mailing list as ham just so it knows stuff from that list is generally
>>> ham.  I had one list that I never figured out what was spammy about it,
>>> and since the corpus of that list went back several years, I fed the whole
>>> thing to SA as ham. Took it several hours but no more problems with that
>>> lists messages now.  Now, the spam that does get through goes into a spam
>>> dir, and a cron job learns it, then deletes it daily.  I'm lazy, and
>>> repetitive tasks are to be done by a cron fired script around this camp.
>>> :)
>>>
>>> --
>>> Cheers, Gene
>>> "There are four boxes to be used in defense of liberty:
>>>  soap, ballot, jury, and ammo. Please use in that order."
>>> -Ed Howdershelt (Author)
>>> Any two philosophers can tell each other all they know in two hours.
>>>                -- Oliver Wendell Holmes, Jr.
>>
>>how do I change my SA from learning mode to working mode?
>
> I believe that is automatic once it has enough data.  See above, 200 msgs of
> each type required IIRC.
>
> Understand that SA only rates the email, and puts its findings in the header.
> It is up to you to determine what is done with mail that is too spammy.  I use
> procmail as the MTA from fetchmail, and procmail is configured to send
> anything that SA labels with 5 stars or over to /dev/null.
>
> --
> Cheers, Gene
> "There are four boxes to be used in defense of liberty:
>  soap, ballot, jury, and ammo. Please use in that order."
> -Ed Howdershelt (Author)
> Delta: The kids will love our inflatable slides.    -- David Letterman
>
>


an example

Received: by simscan 1.4.0 ppid: 97779, pid: 97780, t: 3.8809s
        scanners: regex: 1.4.0 clamav: 0.95/m:50/d:9252 spam: 3.2.5
X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on mx1.alexus.biz
X-Spam-Level: ****
X-Spam-Status: No, score=4.9 required=5.0 tests=BAYES_99,HTML_MESSAGE,
        MIME_HTML_ONLY,SPF_HELO_PASS autolearn=no version=3.2.5

it gave BAYES_99, yet it still think it's autolearn=no, and it still
doesnt think this is SPAM

-- 
http://alexus.org/

Re: sa-learn

Reply via email to