On Fri, 2010-03-12 at 18:50 -0500, Rob McEwen wrote:
> Your stats are certainly valuable and illustrative... but not reflective
> of the stats one would see in a MOST "real world" mail streams where:
>
> (A) the spams were sent to actual users (which would be a distinctively
> different mix of spams compared to a pure honeypot stream of spams [...]
Just for comparison, below are some stats gathered quickly from 2
different and entirely unrelated systems. Real mail stream, real users
only, no traps.
RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM
----------------------------------------------------------------------
11 URIBL_BLACK 1202 62.73 63.94 0.00
8 URIBL_BLACK 572 41.12 78.36 0.00
Unfortunately, both of these systems are still 3.2.5, so there is no
util_rb_3tld love either, which would drive up the %spam numbers quite a
bit.
Oh, and there are some custom rules in place, which take the highest
ranks in these lists, pushing URIBL and all other stock rules down. So
don't base on the rank...
--
char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4";
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1:
(c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}