Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Thomas Cameron
On 5/5/22 14:28, Dave Wreski wrote: No, that's how you train your corpora. If you manually look through the headers of mail that's already been processed by your mail system, the ham should be as close to BAYES_00 as possible, and spam should be at BAYES_99. If that's not the case, then it's

Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Dave Wreski
That's a great call, thanks. I grepped my mail files and didn't find any SPAM_99 headers in any of them. You should be looking for BAYES_99 and BAYES_999 in your corpus. Thanks, Dave. I use my various mailboxes (sa-learn --ham --mbox /home/thomas.cameron/mail/INBOX/[mailbox file] and

Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Thomas Cameron
On 5/5/22 11:59, Dave Wreski wrote: You should probably check that none of your ham (i.e. non-spam) messages contains SPAM_99 or SPAM_999. It can happen when spammers poison your bayes database, and increased score in that case might lead to legitimate mail being misclassified as a spam.

Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Dave Wreski
You should probably check that none of your ham (i.e. non-spam) messages contains SPAM_99 or SPAM_999. It can happen when spammers poison your bayes database, and increased score in that case might lead to legitimate mail being misclassified as a spam. That's a great call, thanks. I grepped

Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Thomas Cameron
On 5/5/22 11:47, Matija Nalis wrote: On Thu, May 05, 2022 at 10:37:40AM -0500, Thomas Cameron wrote: I understand that turning knobs without understanding the consequences can do bad thing, but almost all of the spam that gets through SA on my server has SPAM_99 or SPAM_999 set in the headers.

Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Matija Nalis
You should probably check that none of your ham (i.e. non-spam) messages contains SPAM_99 or SPAM_999. It can happen when spammers poison your bayes database, and increased score in that case might lead to legitimate mail being misclassified as a spam. On Thu, May 05, 2022 at 10:37:40AM -0500,

Re: Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Thomas Cameron
On 5/5/22 10:46, Reindl Harald wrote: Am 05.05.22 um 17:37 schrieb Thomas Cameron: I understand that turning knobs without understanding the consequences can do bad thing, but almost all of the spam that gets through SA on my server has SPAM_99 or SPAM_999 set in the headers. It is

Why shouldn't I set the score for SPAM_99 and SPAM_999 higher?

2022-05-05 Thread Thomas Cameron
I understand that turning knobs without understanding the consequences can do bad thing, but almost all of the spam that gets through SA on my server has SPAM_99 or SPAM_999 set in the headers. It is obviously spam, so I don't really get how it wasn't flagged, but it wasn't. What are the risks