Hi SA User List, Here's my case: postfix + amavisd-new + SpamAssassin 2.64 working on a Gentoo Linux box, serving as a mail server for serveral virtual domains.
Some SpamAssassin details: Bayes learning activated recently, based on about 300 spam mails and 200 ham mails, which accumulate in IMAP folders and are scanned using sa-learn via cron job three times a day. It all seems to work, but I see SA passing through some obvious spam, so I decided to look. And what I dicovered was very surprising for me: SA computes the score well, but suddenly lowers it significantly exactly before returning an answer to amavisd. Here an examples: . . debug: running raw-body-text per-line regexp tests; score so far=4.166 debug: running uri tests; score so far=4.166 debug: uri tests: Done uriRE debug: running full-text regexp tests; score so far=4.166 debug: all '*From' addrs: [EMAIL PROTECTED] debug: all '*To' addrs: [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED] ydamian [EMAIL PROTECTED] debug: forged-HELO: from=media-c.local helo=troyer.co.at by=media-c.de debug: forged-HELO: mismatch on HELO: 'troyer.co.at' != 'media-c.local' debug: forged-HELO: from=wanadoo.fr helo= by=troyer.co.at debug: forged-HELO: mismatch on from: 'media-c.local' != 'troyer.co.at' debug: running meta tests; score so far=5.53 debug: auto-learn? ham=0.2, spam=8, body-hits=4.166, head-hits=1.364 debug: auto-learn: currently using scoreset 2. recomputing score based on scoreset 0. debug: Score set 0 chosen. debug: auto-learn: original score: 5.53, recomputed score: 4.922 debug: Score set 2 chosen. debug: auto-learn? no: inside auto-learn thresholds debug: is spam? score=0.629 required=6.8 tests=BAYES_00,DATE_IN_PAST_12_24,SARE_ADULT2,SARE_OBFUPORNO Then the message is tagged "X-Spam-Status: No, hits=0.6". This is an obvious adult site adv. and SARE rules do a good job. But why the score is lowered at the end? If I use spamassassin command line tool for the SAME message, I get very good result: # spamassassin < 1102684670.M621242P31174V0000000000006210I00006396_0.prodo,S=1996:2,S X-Spam-Checker-Version: SpamAssassin 2.64 (2004-01-11) on prodo.media-c.local X-Spam-Level: ****** X-Spam-Status: No, hits=6.5 required=6.8 tests=BEST_PORN,DATE_IN_PAST_12_24, SARE_ADULT2,SARE_OBFUPORNO autolearn=no version=2.64 -- cut here ------------------------ As is not difficult to gues, I would like to have THESE scores, non-lowered! I suspected the bayesian learning to be blamed... but when checking the learning sesssions logs, everyhting is correct, spam and ham are perfectly sorted and learning is conducted as appropriate. So I am stuck. Why does this mailfunction appear? Suggestions for fixing? Any help would greatly appreciated. Yassen P.S. Please find below enclosed some configs as /etc/mail/spamassassin/local.cf, and others. -- Yassen Damyanov phone: +359-32-968-903 email: [EMAIL PROTECTED] ICQ# : 169382108 web : www.troyer-is.com # cat /etc/mail/spamassassin/local.cf | sed -n -e '/^[\t ]*#\|^[\t ]*$/!p' trusted_networks 127.0.0. required_hits 6.8 rewrite_subject 1 subject_tag [SPAM?] report_safe 1 use_terse_report 0 use_bayes 1 auto_learn 1 skip_rbl_checks 0 use_razor2 1 use_dcc 1 use_pyzor 1 ok_languages de en ok_locales de en # cat /etc/amavisd.conf | sed -n -e '/^[\t ]*#\|^[\t ]*$/!p' # -- Relevant Sections Only -- $MYHOME = '/var/run/amavis'; # (default is '/var/amavis') $spam_quarantine_to = new_RE( [qr'^([EMAIL PROTECTED])@([EMAIL PROTECTED])$'i => '/var/virtual/hosts/${2}/home/${1}/.maildir/.Junk/new'], [qr/.*/ => 'spam-quarantine'] ); $X_HEADER_TAG = 'X-Virus-Scanned'; $X_HEADER_LINE = "by amavisd-new-20030616:clamav-0.80 at prodo.media-c.de"; $undecipherable_subject_tag = '[UNCHECKED] '; $remove_existing_x_scanned_headers = 0; $remove_existing_spam_headers = 0; $sa_local_tests_only = 1; $sa_timeout = 30; $sa_mail_body_size_limit = 150*1024; $sa_tag_level_deflt = -99.0; $sa_tag2_level_deflt = 4.2; $sa_kill_level_deflt = 5.8; $sa_dsn_cutoff_level = 10.0; $sa_spam_subject_tag = '[SPAM?] '; $sa_debug = 1; When running sa-learning, I use