Re: more habeas spam

Karsten Bräckelmann Thu, 08 Jan 2009 07:38:11 -0800

On Tue, 2009-01-06 at 19:31 +0100, Kai Schaetzl wrote:
> It appears to me that the HABEAS rules are hitting only a very tiny fraction 
> of 
> mail, many of the nightly mass-checks don't have a hit at all (or is it that 
> those 
> checks don't contain any network checks?). The aggregated view shows no hits 
> at all 
> for these rules.


Network tests are done once a week, not daily.

> I'm not sure if I'm reading the ruleqa correctly, although I read it's help.
> 1. I'm wondering why many rules show a score of 0.0

These appear to be network tests.

> 2. do I understand it correctly that a nightly check contains only the spam
> received over the last 24 hours?

No. The nightly mass-checks contain the full corpora.

> 3. I don't see any explanation for s/o and rank. (Rank seems to be some sort 
> of 
> ranking according to the hit rate, but I find it hardly understandable that a 
> rule 
> that hits a lot of messages, like URIBL_SURBL, scores 1.0 as rank and a rule 
> that 
> hits almost no messages still scores at half of that. s/o seems to show the 
> ham/spam ratio cleanliness?)

Correct, S/O is the Spam / Overall ratio. The higher that ratio, the
better the rule and the lower the ham hits (in percent, not absolute
numbers).


-- 
char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4";
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1:
(c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}

Re: more habeas spam

Reply via email to