https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6155

--- Comment #141 from Mark Martinec <[email protected]> 2009-10-28 09:02:40 
UTC ---
>> But I agree that more may need re-fixing.
> 
> cool.
> In particular, some of the DNSBLs and most of the DNSWLs are good to 'lock
> down', I feel, as users tend to 'compensate' or correct their scores more
> frequently than other rules -- in my opinion.  Also, if those are given low
> scores by the GA, their operators tend to be annoyed, and it's not good to
> annoy people who we're relying on ;)
> 
> It also reflects that those rules are slightly different, and hopefully 
> more reliable, than a typical body rule for example -- there's no way to
> indicate this to the GA yet, so locking the rules is as good as we can do.

| It is quite possible that some of these hits are still false positives,
| despite several iterations of cleaning

I wonder how much is the low score for some ham rules affected by false
positives present in the spam* corpora. Here is some statistics for
the more prominent ham rules (i.e. the ones with negative scores).

For each rule the table shows a number of hits of this rule for each
corpus - both as a percentage of all entries in a file, and as absolute
counts. The entries standing out from the crowd that may need re-checking
are labeled with *** :

score ALL_TRUSTED -1.000
 0.046 %     1/2194 spam-bayes-net-bb-kmcgrail
 0.017 %    4/23761 spam-bayes-net-mmartinec
 0.014 %    5/36941 spam-bayes-net-hege
 0.001 %    1/81265 spam-bayes-net-bluestreak
 0.000 %   1/931863 spam-bayes-net-dos

score BAYES_00  0 0 -1.2 -1.9
 5.652 %   104/1840 spam-bayes-net-bb-jhardin  ***
 1.805 %  429/23761 spam-bayes-net-mmartinec
 1.606 %    33/2055 spam-bayes-net-ahenry
 0.439 %  357/81265 spam-bayes-net-bluestreak
 0.374 %  138/36941 spam-bayes-net-hege
 0.030 % 445/1489699 spam-bayes-net-jm
 0.017 % 156/931863 spam-bayes-net-dos

score DCC_REPUT_00_12  0 -0.8 0 -0.4
 0.164 %   39/23761 spam-bayes-net-mmartinec

score HABEAS_ACCREDITED_SOI 0 -1.634 0 -0.475
 5.382 %    76/1412 spam-bayes-net-bb-guenther_fraud  ***
 0.272 %     5/1840 spam-bayes-net-bb-jhardin
 0.091 %     2/2194 spam-bayes-net-bb-kmcgrail
 0.059 %   14/23761 spam-bayes-net-mmartinec
 0.049 %   18/36941 spam-bayes-net-hege
 0.037 % 558/1489699 spam-bayes-net-jm
 0.030 %     2/6728 spam-bayes-net-wt-en1
 0.018 %   15/81265 spam-bayes-net-bluestreak
 0.000 %   1/931863 spam-bayes-net-dos

score RCVD_IN_DNSWL_HI  0 -1.8 0 -1.8
 0.163 %     3/1840 spam-bayes-net-bb-jhardin  ***
 0.091 %     2/2194 spam-bayes-net-bb-kmcgrail
 0.071 %     1/1412 spam-bayes-net-bb-guenther_fraud
 0.003 %    1/36941 spam-bayes-net-hege
 0.000 %  1/1489699 spam-bayes-net-jm

score RCVD_IN_DNSWL_MED  0 -1.5 0 -1.2
 1.250 %    23/1840 spam-bayes-net-bb-jhardin  ***
(1.108 %      7/632 spam-bayes-net-binnocenti.OFF)
 0.638 %    14/2194 spam-bayes-net-bb-kmcgrail
 0.469 %  381/81265 spam-bayes-net-bluestreak
 0.438 %     9/2055 spam-bayes-net-ahenry
 0.223 %    15/6728 spam-bayes-net-wt-en1
 0.214 %   79/36941 spam-bayes-net-hege
 0.046 % 682/1489699 spam-bayes-net-jm
 0.042 %     3/7185 spam-bayes-net-bb-zmi
 0.013 %    3/23761 spam-bayes-net-mmartinec
 0.010 %    2/19160 spam-bayes-net-wt-en4
 0.003 %  29/931863 spam-bayes-net-dos

score RCVD_IN_DNSWL_LOW  0 -0.6 0 -1.1
 16.153 % 240627/1489699 spam-bayes-net-jm  ***
(9.810 %     62/632 spam-bayes-net-binnocenti.OFF)
 1.739 %    32/1840 spam-bayes-net-bb-jhardin
 1.600 %  591/36941 spam-bayes-net-hege
 1.159 %    78/6728 spam-bayes-net-wt-en1
 1.133 %    16/1412 spam-bayes-net-bb-guenther_fraud
 0.925 %    19/2055 spam-bayes-net-ahenry
 0.365 %     8/2194 spam-bayes-net-bb-kmcgrail
 0.107 %   87/81265 spam-bayes-net-bluestreak
 0.097 %     7/7185 spam-bayes-net-bb-zmi
 0.022 % 201/931863 spam-bayes-net-dos
 0.021 %    5/23761 spam-bayes-net-mmartinec
 0.016 %    3/19160 spam-bayes-net-wt-en4

score RCVD_IN_BSP_TRUSTED 0 -0.001 0 -0.001
 5.312 %    75/1412 spam-bayes-net-bb-guenther_fraud  ***
 0.030 %     2/6728 spam-bayes-net-wt-en1
 0.029 %    7/23761 spam-bayes-net-mmartinec
 0.029 % 435/1489699 spam-bayes-net-jm
 0.015 %   12/81265 spam-bayes-net-bluestreak
 0.003 %    1/36941 spam-bayes-net-hege
 0.001 %  11/931863 spam-bayes-net-dos

score RCVD_IN_IADB_DK 0 -0.044 0 -0.001
 0.059 %     4/6728 spam-bayes-net-wt-en1
 0.054 %     1/1840 spam-bayes-net-bb-jhardin
 0.033 %   27/81265 spam-bayes-net-bluestreak
 0.004 %    1/23761 spam-bayes-net-mmartinec
 0.001 % 21/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_RDNS 0 -0.018 0 -0.001
 0.342 %    23/6728 spam-bayes-net-wt-en1  ***
 0.054 %     1/1840 spam-bayes-net-bb-jhardin
 0.049 %     1/2055 spam-bayes-net-ahenry
 0.033 %   27/81265 spam-bayes-net-bluestreak
 0.004 %    1/23761 spam-bayes-net-mmartinec
 0.002 % 26/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_OPTIN 0 -3.265 0 -2.791
 0.342 %    23/6728 spam-bayes-net-wt-en1  ***
 0.049 %     1/2055 spam-bayes-net-ahenry
 0.000 %  4/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_OPTIN_GT50 0 -0.219 0 -1.041
 0.054 %     1/1840 spam-bayes-net-bb-jhardin

score RCVD_IN_IADB_DOPTIN 0
 0.000 %  7/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_DOPTIN_LT50 0 -0.001 0 -0.001
 0.026 %   21/81265 spam-bayes-net-bluestreak  ***
 0.001 % 15/1489699 spam-bayes-net-jm.log

score RCVD_IN_IADB_DOPTIN_GT50 0
 0.007 %    6/81265 spam-bayes-net-bluestreak
 0.004 %    1/23761 spam-bayes-net-mmartinec

score RCVD_IN_IADB_ML_DOPTIN 0
 0.000 %  2/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_UT_CPR_MAT 0 -0.001 0 -0.052
 0.026 %   21/81265 spam-bayes-net-bluestreak  ***
 0.001 % 15/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_MI_CPR_MAT 0 -0.079 0 -0.001
 0.026 %   21/81265 spam-bayes-net-bluestreak  ***
 0.001 % 15/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_LISTED 0 -1.144 0 -0.001
 0.342 %    23/6728 spam-bayes-net-wt-en1  ***
 0.054 %     1/1840 spam-bayes-net-bb-jhardin
 0.049 %     1/2055 spam-bayes-net-ahenry
 0.033 %   27/81265 spam-bayes-net-bluestreak
 0.004 %    1/23761 spam-bayes-net-mmartinec
 0.002 % 26/1489699 spam-bayes-net-jm
 0.000 %   1/931863 spam-bayes-net-dos

score RCVD_IN_IADB_SENDERID 0 -0.001 0 -0.001
 0.208 %    14/6728 spam-bayes-net-wt-en1  ***
 0.049 %     1/2055 spam-bayes-net-ahenry
 0.033 %   27/81265 spam-bayes-net-bluestreak
 0.004 %    1/23761 spam-bayes-net-mmartinec
 0.000 %  4/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_SPF 0 -0.006 0 -0.042
 0.342 %    23/6728 spam-bayes-net-wt-en1  ***
 0.054 %     1/1840 spam-bayes-net-bb-jhardin
 0.049 %     1/2055 spam-bayes-net-ahenry
 0.033 %   27/81265 spam-bayes-net-bluestreak
 0.004 %    1/23761 spam-bayes-net-mmartinec
 0.002 % 26/1489699 spam-bayes-net-jm

score RCVD_IN_IADB_VOUCHED 0 -1.718 0 -0.956
 0

-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to