https://issues.apache.org/SpamAssassin/show_bug.cgi?id=5736
--- Comment #13 from Karsten Bräckelmann <[EMAIL PROTECTED]> 2008-10-30 10:13:47 PST --- The ruleqa results are heavily biased anyway. The only ham hits are in Michael's corpus, which is quite "small" compared to Daryl's and Justin's ham corpus. Extrapolating the number of hams to align the corpora draws an even much worse picture and makes the S/O ratio drop significantly -- below the already *poor* 0.5 it shows today (which is without Theo's massive corpus, granted). Most of the English-centric ham corpora are much less likely to contain German company domains. I kind of wonder if From headers are a good indicator today anyway. Most of my spam shows a forged sender. The increasing problem of backscatter supports this. +1 for seriously down-scoring FROM_DOMAIN_NOVOWEL, if we keep it at all. Let's just hope GMX uses sa-update. Ironically, a German company. If they don't, I'm afraid it'll take quite some GMX users complaining, to gently massage the message from front-line support down to the tech staff. (GMX themself evaded this rule, FWIW, using gmx-gmbh.de with a hyphen. Doh!) -- Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
