On 5/5/22 14:28, Dave Wreski wrote:
No, that's how you train your corpora. If you manually look through
the headers of mail that's already been processed by your mail system,
the ham should be as close to BAYES_00 as possible, and spam should be
at BAYES_99. If that's not the case, then it's
That's a great call, thanks. I grepped my mail files and didn't find
any SPAM_99 headers in any of them.
You should be looking for BAYES_99 and BAYES_999 in your corpus.
Thanks, Dave. I use my various mailboxes (sa-learn --ham --mbox
/home/thomas.cameron/mail/INBOX/[mailbox file] and
On 5/5/22 11:59, Dave Wreski wrote:
You should probably check that none of your ham (i.e. non-spam)
messages contains SPAM_99 or SPAM_999. It can happen when spammers
poison your bayes database, and increased score in that case might
lead to legitimate mail being misclassified as a spam.
You should probably check that none of your ham (i.e. non-spam)
messages contains SPAM_99 or SPAM_999. It can happen when spammers
poison your bayes database, and increased score in that case might
lead to legitimate mail being misclassified as a spam.
That's a great call, thanks. I grepped
On 5/5/22 11:47, Matija Nalis wrote:
On Thu, May 05, 2022 at 10:37:40AM -0500, Thomas Cameron wrote:
I understand that turning knobs without understanding the consequences can
do bad thing, but almost all of the spam that gets through SA on my server
has SPAM_99 or SPAM_999 set in the headers.
You should probably check that none of your ham (i.e. non-spam)
messages contains SPAM_99 or SPAM_999. It can happen when spammers
poison your bayes database, and increased score in that case might
lead to legitimate mail being misclassified as a spam.
On Thu, May 05, 2022 at 10:37:40AM -0500,
On 5/5/22 10:46, Reindl Harald wrote:
Am 05.05.22 um 17:37 schrieb Thomas Cameron:
I understand that turning knobs without understanding the
consequences can do bad thing, but almost all of the spam that gets
through SA on my server has SPAM_99 or SPAM_999 set in the headers.
It is
I understand that turning knobs without understanding the consequences
can do bad thing, but almost all of the spam that gets through SA on my
server has SPAM_99 or SPAM_999 set in the headers. It is obviously spam,
so I don't really get how it wasn't flagged, but it wasn't. What are the
risks