On 2019-03-17 5:46 p.m., John Capo wrote:
On Sun, March 17, 2019 17:03, James wrote:
I've been getting a lot of spam so I'm thinking of lowering the
"required" number.


About 50 % spam gets a 4.4 so my required=4.5 is a tiny bit high.

I run an IMAP server with my own spamassassin.
I have over 60 folders I was running sa-learn --ham on.
I was also running sa-learn --ham on my .Sent folder.
I have 2 other folders, one of ham and one for missed spam.
I put messages manually in these folders.


What is the Bayes score for the missed spam?
This is from a missed spam.
How do I see the bayes score?

X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13)
X-Spam-Status: No, score=3.2 required=4.5 tests=BAYES_50,DKIM_SIGNED,
        DKIM_VALID,DKIM_VALID_AU,HTML_IMAGE_ONLY_32,HTML_MESSAGE,RDNS_NONE,
        SPF_HELO_PASS,T_REMOTE_IMAGE,URIBL_BLACK autolearn=no
        autolearn_force=no version=3.4.2


$ sudo sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0    4665448          0  non-token data: nspam
0.000          0   51031938          0  non-token data: nham
0.000          0     226073          0  non-token data: ntokens
0.000          0 1354356549          0  non-token data: oldest atime
0.000          0 1552875700          0  non-token data: newest atime
0.000 0 1552876023 0 non-token data: last journal sync atime
0.000          0 1552862792          0  non-token data: last expiry atime
0.000 0 2608811 0 non-token data: last expire atime delta 0.000 0 114913 0 non-token data: last expire reduction count

This is after not doing my inboxes and not my sent box.

# sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0        338          0  non-token data: nspam
0.000          0        121          0  non-token data: nham
0.000          0      73047          0  non-token data: ntokens
0.000          0 1354356549          0  non-token data: oldest atime
0.000          0 1552856726          0  non-token data: newest atime
0.000 0 0 0 non-token data: last journal sync atime
0.000          0          0          0  non-token data: last expiry atime
0.000 0 0 0 non-token data: last expire atime delta 0.000 0 0 0 non-token data: last expire reduction count

https://spamassassin.apache.org/full/3.1.x/doc/sa-learn.html
A decent number of messages are required before results are useful for ham/spam 
determination.
How many of each?


Reply via email to