> TB> Hey cool, done that now.  Just looked at the headers of a message
> TB> received which says "autolearn=ham" This was a message from the SA
> TB> group funnily enough - presumably that is correct?
>
> Unless that message included spam samples, then no problem.
>
> I suggest you set your non-spam auto-learn threshold to -0.01 to make
> sure that spam that hits no rules is not accidentally learned as ham.

errr...what?  who? where? how???
Also, loooking at more headers - if they say "autolearn=no", does that
just mean SA had no idea if it was spam or ham, or does it just mean that
autolearn is off and I Was looking at an old message? ;)else?

> My understanding is that each domain with a $HOME will have one
> $HOME/.spamassassin directory, and the bayes database built there will
> apply to all [EMAIL PROTECTED] for that domain.

Cool.  That does indeed seem to be the case - my mailboxes were
refreshingly free of spam this morning - hurrah!

> cp /dev/null $file
> or
> cat </dev/null >$file
> are two methods I've used to empty files.

okay will do that - is there any advantage of one over the other apart
from less typing? ;)

> TB> - is my first ever shell script!:
[..]
> TB> Any obvious flaws there guys, or something I could do better?   It
>
> Looks good to me.  I wouldn't cat them all into one file first, since my
> understanding is that the shorter/quicker sa-learn runs are better (less
> chance they'll block bayes update by incoming email and auto-learn).

okay, thanks m8.  You cat them in your script though don't you?

> TB> If the former, then presumably my script would be better off
> contatenating
> TB> the spam and ham files before passing them to a single run of
> sa-learn?
>
> I run my scripts once an hour.

Blimey - do you get THAT much spam? ;)

> You need 200+ spams and 200+ hams before Bayes takes effect and starts
> applying its scores to your emails. It then remains effective unless you
> drop below those numbers (such as by deleting the database files and
> starting over). That has nothing to do with sa-learn. The more often
> sa-learn runs, the more current your bayes database is.

Okay, thanks.
For ham, do you just copy everything from your inbox (apart from spam not
caught) or is there stuff you WOULDN'T put through the spam filter? eg,
all the posts to this list?
I am on a number of lists and the volume would make perfect ham material,
but I'd be worried sometimes that the content wouldn't and certain
characteristics - eg being sent to a large no. of people, me not being
explicitly set as a recipient.
For spam, is there any value in passing already identified spam (sent to
the spambox thru sa-learn?

thanks again,

Tony

Reply via email to