Re: mk_meta_rule_scores - does it work correctly?:)

Kris Deugau Tue, 14 Mar 2017 08:24:06 -0700

mar...@mejor.pl wrote:

Hi!
Thanks to AXB seek-in-phrases-in-log works OK. Now I'm on the next step with 
automated creating rules.
I suspect that mk_meta_rule_scores doesn't assign scores correctly. I set in 
mk_meta_rule_scores:
my %scoremap = (
  '70' => '1.5',
  '4' => '2.0',
  '0.01' => '3.0',
);

If I understand correctly (quite possible I don't; I haven't dug in tothe internals of this stage), the %scoremap hash above indicates thepercentage of the messages that need to hit on a subrule for thatsubrule to be included in one of the meta rules.

So, 70% of the mail in the set would need to hit on a subrule for it tobe included in the first group, 4% in the second, and 0.01% in the third.

If I read the information flow correctly, this is actually decided byseek-phrases-in-log, which spits out subrules that reached a certain hitrate in blocks, followed by the "# passed hit-rate threshold nnn" line.mk_meta_rule_scores just takes that in, collects the rule names in eachblock, and spits out the meta.

(In fact the stock setup refers to mk_meta_rule, although it's nearlyidentical to mk_meta_rule_scores.)

By raising the hit percentage to 70%, you're requiring that 70% of thespam you're using must hit on one of the subrules. TBH, by that point,you may as well hand-extract a couple of the subrules and make themstatic, standalone rules.

My experience has been that you need lots of mail to generate multiplemetas in any case; I've taken a different tack and separated outdifferent groups of mail for different generated rule sets instead.


-kgd

Re: mk_meta_rule_scores - does it work correctly?:)

Reply via email to