Bayes filter marking everything as ham

Peter Carlson Tue, 31 May 2016 14:59:13 -0700

(sorry if this is a repost, I dont see my messages coming through...the irony of spamassassin.apache.org trapping my request for help as spam. I have snipped the logfile entries which I think were causing it to be tagged as spam)

All of my messages are being tagged with BAYES_00=-1.9
I have cleared the bayes db (sa-learn --clear), then I manually trained. Here are the results:

sa-learn --dump magic0.000 0 3 0 non-token data: bayes db version 0.000 0 642 0 non-token data: nspam 0.000 0 9415 0 non-token data: nham 0.000 0 119685 0 non-token data: ntokens 0.000 0 1461963062 0 non-token data: oldest atime 0.000 0 1464701914 0 non-token data: newest atime 0.000 0 0 0 non-token data: last journal sync atime 0.000 0 1464701937 0 non-token data: last expiry atime 0.000 0 2764800 0 non-token data: last expire atime delta 0.000 0 455262 0 non-token data: last expire reduction count

Here are two examples it shows that the bayes filter is very confident these emails are ham:

May 30 09:04:53 www amavis[16577]: (16577-03) Passed CLEAN {RelayedInbound}, <snip> Tests: [BAYES_00=-1.9,RCVD_IN_MSPIKE_H2=-0.001,SPF_PASS=-0.001,URIBL_BLOCKED=0.001], autolearn=ham autolearn_force=no, autolearnscore=-0.001, 3992 ms May 30 08:34:13 www amavis[16252]: (16252-01) Passed SPAMMY {RelayedTaggedInbound}, <snip> Tests: [BAYES_00=-1.9,HTML_MESSAGE=0.001,HTML_TAG_BALANCE_BODY=1.157,MIME_HTML_MOSTLY=0.428,MPART_ALT_DIFF=0.79,RAZOR2_CHECK=0.922,SPF_FAIL=0.001,SPF_HELO_FAIL=0.001,THIS_AD=1.675,T_HTML_TAG_BALANCE_CENTER=0.01,URIBL_BLOCKED=0.001,URIBL_DBL_SPAM=2.5], autolearn=no autolearn_force=no, autolearnscore=8.272, 4054 ms

The spam is learned by a simple bash script. The users (my family) move spam into a SPAM folder. This script then runs every night ( I have removed some of the logging lines and comments for brevity):

#!/bin/bash# delete messages this oldcleanafter=14# grab all the user foldersusers=`find /var/spool/cyrus/mail -name SPAM -print`for u in ${users[@]}; do inbox=${u%/*} folder=${u##*/} user=${inbox##*/} sa-learn --nosync --spam --progress --dir $inbox/SPAM sa-learn --nosync --ham --progress --dir $inboxdone# sync the sa dbsa-learn --sync

Setup:
ubuntu server 14.04
postfix:2.11.0
amavis:2.7.1
spamassassin:3.4.0

postfix config (main.cf):

content_filter = smtp-amavis:[127.0.0.1]:10024smtpd_recipient_restrictions = permit_sasl_authenticated, permit_mynetworks, reject_unauth_destination, reject_rbl_client zen.spamhaus.org, reject_rbl_client bl.spamcop.net reject_rbl_client ix.dnsbl.manitu.net, reject_rbl_client cbl.abuseat.org, reject_rbl_client b.barracudacentral.org, reject_rbl_client new.spam.dnsbl.sorbs.netsmtpd_client_restrictions = permit_sasl_authenticated, permit_mynetworks, reject_rbl_client zen.spamhaus.org, reject_rbl_client bl.spamcop.net reject_rbl_client ix.dnsbl.manitu.net, reject_rbl_client cbl.abuseat.org, reject_rbl_client b.barracudacentral.org, reject_rbl_client new.spam.dnsbl.sorbs.net

spamassasin config:

rewrite_header Subject *****PC SPAM***** trusted_networks 192.168. required_score 5.0 use_bayes 1 bayes_auto_learn 0# bayes_ignore_header X-Bogosity# bayes_ignore_header X-Spam-Flag# bayes_ignore_header X-Spam-Statusifplugin Mail::SpamAssassin::Plugin::Shortcircuit# shortcircuit USER_IN_WHITELIST on# shortcircuit USER_IN_DEF_WHITELIST on# shortcircuit USER_IN_ALL_SPAM_TO on# shortcircuit SUBJECT_IN_WHITELIST on# shortcircuit USER_IN_BLACKLIST on# shortcircuit USER_IN_BLACKLIST_TO on# shortcircuit SUBJECT_IN_BLACKLIST on# shortcircuit ALL_TRUSTED on# shortcircuit BAYES_99 spam# shortcircuit BAYES_00 ham

Bayes filter marking everything as ham

Reply via email to