https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6155
--- Comment #141 from Mark Martinec <[email protected]> 2009-10-28 09:02:40 UTC --- >> But I agree that more may need re-fixing. > > cool. > In particular, some of the DNSBLs and most of the DNSWLs are good to 'lock > down', I feel, as users tend to 'compensate' or correct their scores more > frequently than other rules -- in my opinion. Also, if those are given low > scores by the GA, their operators tend to be annoyed, and it's not good to > annoy people who we're relying on ;) > > It also reflects that those rules are slightly different, and hopefully > more reliable, than a typical body rule for example -- there's no way to > indicate this to the GA yet, so locking the rules is as good as we can do. | It is quite possible that some of these hits are still false positives, | despite several iterations of cleaning I wonder how much is the low score for some ham rules affected by false positives present in the spam* corpora. Here is some statistics for the more prominent ham rules (i.e. the ones with negative scores). For each rule the table shows a number of hits of this rule for each corpus - both as a percentage of all entries in a file, and as absolute counts. The entries standing out from the crowd that may need re-checking are labeled with *** : score ALL_TRUSTED -1.000 0.046 % 1/2194 spam-bayes-net-bb-kmcgrail 0.017 % 4/23761 spam-bayes-net-mmartinec 0.014 % 5/36941 spam-bayes-net-hege 0.001 % 1/81265 spam-bayes-net-bluestreak 0.000 % 1/931863 spam-bayes-net-dos score BAYES_00 0 0 -1.2 -1.9 5.652 % 104/1840 spam-bayes-net-bb-jhardin *** 1.805 % 429/23761 spam-bayes-net-mmartinec 1.606 % 33/2055 spam-bayes-net-ahenry 0.439 % 357/81265 spam-bayes-net-bluestreak 0.374 % 138/36941 spam-bayes-net-hege 0.030 % 445/1489699 spam-bayes-net-jm 0.017 % 156/931863 spam-bayes-net-dos score DCC_REPUT_00_12 0 -0.8 0 -0.4 0.164 % 39/23761 spam-bayes-net-mmartinec score HABEAS_ACCREDITED_SOI 0 -1.634 0 -0.475 5.382 % 76/1412 spam-bayes-net-bb-guenther_fraud *** 0.272 % 5/1840 spam-bayes-net-bb-jhardin 0.091 % 2/2194 spam-bayes-net-bb-kmcgrail 0.059 % 14/23761 spam-bayes-net-mmartinec 0.049 % 18/36941 spam-bayes-net-hege 0.037 % 558/1489699 spam-bayes-net-jm 0.030 % 2/6728 spam-bayes-net-wt-en1 0.018 % 15/81265 spam-bayes-net-bluestreak 0.000 % 1/931863 spam-bayes-net-dos score RCVD_IN_DNSWL_HI 0 -1.8 0 -1.8 0.163 % 3/1840 spam-bayes-net-bb-jhardin *** 0.091 % 2/2194 spam-bayes-net-bb-kmcgrail 0.071 % 1/1412 spam-bayes-net-bb-guenther_fraud 0.003 % 1/36941 spam-bayes-net-hege 0.000 % 1/1489699 spam-bayes-net-jm score RCVD_IN_DNSWL_MED 0 -1.5 0 -1.2 1.250 % 23/1840 spam-bayes-net-bb-jhardin *** (1.108 % 7/632 spam-bayes-net-binnocenti.OFF) 0.638 % 14/2194 spam-bayes-net-bb-kmcgrail 0.469 % 381/81265 spam-bayes-net-bluestreak 0.438 % 9/2055 spam-bayes-net-ahenry 0.223 % 15/6728 spam-bayes-net-wt-en1 0.214 % 79/36941 spam-bayes-net-hege 0.046 % 682/1489699 spam-bayes-net-jm 0.042 % 3/7185 spam-bayes-net-bb-zmi 0.013 % 3/23761 spam-bayes-net-mmartinec 0.010 % 2/19160 spam-bayes-net-wt-en4 0.003 % 29/931863 spam-bayes-net-dos score RCVD_IN_DNSWL_LOW 0 -0.6 0 -1.1 16.153 % 240627/1489699 spam-bayes-net-jm *** (9.810 % 62/632 spam-bayes-net-binnocenti.OFF) 1.739 % 32/1840 spam-bayes-net-bb-jhardin 1.600 % 591/36941 spam-bayes-net-hege 1.159 % 78/6728 spam-bayes-net-wt-en1 1.133 % 16/1412 spam-bayes-net-bb-guenther_fraud 0.925 % 19/2055 spam-bayes-net-ahenry 0.365 % 8/2194 spam-bayes-net-bb-kmcgrail 0.107 % 87/81265 spam-bayes-net-bluestreak 0.097 % 7/7185 spam-bayes-net-bb-zmi 0.022 % 201/931863 spam-bayes-net-dos 0.021 % 5/23761 spam-bayes-net-mmartinec 0.016 % 3/19160 spam-bayes-net-wt-en4 score RCVD_IN_BSP_TRUSTED 0 -0.001 0 -0.001 5.312 % 75/1412 spam-bayes-net-bb-guenther_fraud *** 0.030 % 2/6728 spam-bayes-net-wt-en1 0.029 % 7/23761 spam-bayes-net-mmartinec 0.029 % 435/1489699 spam-bayes-net-jm 0.015 % 12/81265 spam-bayes-net-bluestreak 0.003 % 1/36941 spam-bayes-net-hege 0.001 % 11/931863 spam-bayes-net-dos score RCVD_IN_IADB_DK 0 -0.044 0 -0.001 0.059 % 4/6728 spam-bayes-net-wt-en1 0.054 % 1/1840 spam-bayes-net-bb-jhardin 0.033 % 27/81265 spam-bayes-net-bluestreak 0.004 % 1/23761 spam-bayes-net-mmartinec 0.001 % 21/1489699 spam-bayes-net-jm score RCVD_IN_IADB_RDNS 0 -0.018 0 -0.001 0.342 % 23/6728 spam-bayes-net-wt-en1 *** 0.054 % 1/1840 spam-bayes-net-bb-jhardin 0.049 % 1/2055 spam-bayes-net-ahenry 0.033 % 27/81265 spam-bayes-net-bluestreak 0.004 % 1/23761 spam-bayes-net-mmartinec 0.002 % 26/1489699 spam-bayes-net-jm score RCVD_IN_IADB_OPTIN 0 -3.265 0 -2.791 0.342 % 23/6728 spam-bayes-net-wt-en1 *** 0.049 % 1/2055 spam-bayes-net-ahenry 0.000 % 4/1489699 spam-bayes-net-jm score RCVD_IN_IADB_OPTIN_GT50 0 -0.219 0 -1.041 0.054 % 1/1840 spam-bayes-net-bb-jhardin score RCVD_IN_IADB_DOPTIN 0 0.000 % 7/1489699 spam-bayes-net-jm score RCVD_IN_IADB_DOPTIN_LT50 0 -0.001 0 -0.001 0.026 % 21/81265 spam-bayes-net-bluestreak *** 0.001 % 15/1489699 spam-bayes-net-jm.log score RCVD_IN_IADB_DOPTIN_GT50 0 0.007 % 6/81265 spam-bayes-net-bluestreak 0.004 % 1/23761 spam-bayes-net-mmartinec score RCVD_IN_IADB_ML_DOPTIN 0 0.000 % 2/1489699 spam-bayes-net-jm score RCVD_IN_IADB_UT_CPR_MAT 0 -0.001 0 -0.052 0.026 % 21/81265 spam-bayes-net-bluestreak *** 0.001 % 15/1489699 spam-bayes-net-jm score RCVD_IN_IADB_MI_CPR_MAT 0 -0.079 0 -0.001 0.026 % 21/81265 spam-bayes-net-bluestreak *** 0.001 % 15/1489699 spam-bayes-net-jm score RCVD_IN_IADB_LISTED 0 -1.144 0 -0.001 0.342 % 23/6728 spam-bayes-net-wt-en1 *** 0.054 % 1/1840 spam-bayes-net-bb-jhardin 0.049 % 1/2055 spam-bayes-net-ahenry 0.033 % 27/81265 spam-bayes-net-bluestreak 0.004 % 1/23761 spam-bayes-net-mmartinec 0.002 % 26/1489699 spam-bayes-net-jm 0.000 % 1/931863 spam-bayes-net-dos score RCVD_IN_IADB_SENDERID 0 -0.001 0 -0.001 0.208 % 14/6728 spam-bayes-net-wt-en1 *** 0.049 % 1/2055 spam-bayes-net-ahenry 0.033 % 27/81265 spam-bayes-net-bluestreak 0.004 % 1/23761 spam-bayes-net-mmartinec 0.000 % 4/1489699 spam-bayes-net-jm score RCVD_IN_IADB_SPF 0 -0.006 0 -0.042 0.342 % 23/6728 spam-bayes-net-wt-en1 *** 0.054 % 1/1840 spam-bayes-net-bb-jhardin 0.049 % 1/2055 spam-bayes-net-ahenry 0.033 % 27/81265 spam-bayes-net-bluestreak 0.004 % 1/23761 spam-bayes-net-mmartinec 0.002 % 26/1489699 spam-bayes-net-jm score RCVD_IN_IADB_VOUCHED 0 -1.718 0 -0.956 0 -- Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
