On Tue, 2004-03-16 at 23:14, [EMAIL PROTECTED] wrote: > > TB> Hey cool, done that now. Just looked at the headers of a message > > TB> received which says "autolearn=ham" This was a message from the SA > > TB> group funnily enough - presumably that is correct? > > > > Unless that message included spam samples, then no problem. > > > > I suggest you set your non-spam auto-learn threshold to -0.01 to make > > sure that spam that hits no rules is not accidentally learned as ham. > > errr...what? who? where? how??? > Also, loooking at more headers - if they say "autolearn=no", does that > just mean SA had no idea if it was spam or ham, or does it just mean that > autolearn is off and I Was looking at an old message? ;)else? > > > My understanding is that each domain with a $HOME will have one > > $HOME/.spamassassin directory, and the bayes database built there will > > apply to all [EMAIL PROTECTED] for that domain. > > Cool. That does indeed seem to be the case - my mailboxes were > refreshingly free of spam this morning - hurrah! > > > cp /dev/null $file > > or > > cat </dev/null >$file > > are two methods I've used to empty files. > > okay will do that - is there any advantage of one over the other apart > from less typing? ;) >
A better way to zero out a file is by using '>' notation which will not
destroy any file descriptors attached to the file aside from being
extremely easy and short.
for f in $(find . -name "whatever")
do
> $f
more processing on $f
done
> > TB> - is my first ever shell script!:
> [..]
> > TB> Any obvious flaws there guys, or something I could do better? It
> >
> > Looks good to me. I wouldn't cat them all into one file first, since my
> > understanding is that the shorter/quicker sa-learn runs are better (less
> > chance they'll block bayes update by incoming email and auto-learn).
>
> okay, thanks m8. You cat them in your script though don't you?
>
> > TB> If the former, then presumably my script would be better off
> > contatenating
> > TB> the spam and ham files before passing them to a single run of
> > sa-learn?
> >
> > I run my scripts once an hour.
>
> Blimey - do you get THAT much spam? ;)
>
> > You need 200+ spams and 200+ hams before Bayes takes effect and starts
> > applying its scores to your emails. It then remains effective unless you
> > drop below those numbers (such as by deleting the database files and
> > starting over). That has nothing to do with sa-learn. The more often
> > sa-learn runs, the more current your bayes database is.
>
> Okay, thanks.
> For ham, do you just copy everything from your inbox (apart from spam not
> caught) or is there stuff you WOULDN'T put through the spam filter? eg,
> all the posts to this list?
> I am on a number of lists and the volume would make perfect ham material,
> but I'd be worried sometimes that the content wouldn't and certain
> characteristics - eg being sent to a large no. of people, me not being
> explicitly set as a recipient.
> For spam, is there any value in passing already identified spam (sent to
> the spambox thru sa-learn?
>
> thanks again,
>
> Tony
--
Jim Conner <[EMAIL PROTECTED]>
signature.asc
Description: This is a digitally signed message part
