Re: Re[4]: [N00b alert]SA in a cpanel environment

Jim Conner 17 Mar 2004 14:46:20 -0000

On Tue, 2004-03-16 at 23:14, [EMAIL PROTECTED] wrote:
> > TB> Hey cool, done that now.  Just looked at the headers of a message
> > TB> received which says "autolearn=ham" This was a message from the SA
> > TB> group funnily enough - presumably that is correct?
> >
> > Unless that message included spam samples, then no problem.
> >
> > I suggest you set your non-spam auto-learn threshold to -0.01 to make
> > sure that spam that hits no rules is not accidentally learned as ham.
> 
> errr...what?  who? where? how???
> Also, loooking at more headers - if they say "autolearn=no", does that
> just mean SA had no idea if it was spam or ham, or does it just mean that
> autolearn is off and I Was looking at an old message? ;)else?
> 
> > My understanding is that each domain with a $HOME will have one
> > $HOME/.spamassassin directory, and the bayes database built there will
> > apply to all [EMAIL PROTECTED] for that domain.
> 
> Cool.  That does indeed seem to be the case - my mailboxes were
> refreshingly free of spam this morning - hurrah!
> 
> > cp /dev/null $file
> > or
> > cat </dev/null >$file
> > are two methods I've used to empty files.
> 
> okay will do that - is there any advantage of one over the other apart
> from less typing? ;)
>


A better way to zero out a file is by using '>' notation which will not
destroy any file descriptors attached to the file aside from being
extremely easy and short.

for f in $(find . -name "whatever")
do
    > $f
    more processing on $f
done

> > TB> - is my first ever shell script!:
> [..]
> > TB> Any obvious flaws there guys, or something I could do better?   It
> >
> > Looks good to me.  I wouldn't cat them all into one file first, since my
> > understanding is that the shorter/quicker sa-learn runs are better (less
> > chance they'll block bayes update by incoming email and auto-learn).
> 
> okay, thanks m8.  You cat them in your script though don't you?
> 
> > TB> If the former, then presumably my script would be better off
> > contatenating
> > TB> the spam and ham files before passing them to a single run of
> > sa-learn?
> >
> > I run my scripts once an hour.
> 
> Blimey - do you get THAT much spam? ;)
> 
> > You need 200+ spams and 200+ hams before Bayes takes effect and starts
> > applying its scores to your emails. It then remains effective unless you
> > drop below those numbers (such as by deleting the database files and
> > starting over). That has nothing to do with sa-learn. The more often
> > sa-learn runs, the more current your bayes database is.
> 
> Okay, thanks.
> For ham, do you just copy everything from your inbox (apart from spam not
> caught) or is there stuff you WOULDN'T put through the spam filter? eg,
> all the posts to this list?
> I am on a number of lists and the volume would make perfect ham material,
> but I'd be worried sometimes that the content wouldn't and certain
> characteristics - eg being sent to a large no. of people, me not being
> explicitly set as a recipient.
> For spam, is there any value in passing already identified spam (sent to
> the spambox thru sa-learn?
> 
> thanks again,
> 
> Tony
-- 
Jim Conner <[EMAIL PROTECTED]>

signature.asc
Description: This is a digitally signed message part

Re: Re[4]: [N00b alert]SA in a cpanel environment

Reply via email to