mar...@mejor.pl wrote:
Hi!
Thanks to AXB seek-in-phrases-in-log works OK. Now I'm on the next step with 
automated creating rules.
I suspect that mk_meta_rule_scores doesn't assign scores correctly. I set in 
mk_meta_rule_scores:
my %scoremap = (
  '70' => '1.5',
  '4' => '2.0',
  '0.01' => '3.0',
);

If I understand correctly (quite possible I don't; I haven't dug in to the internals of this stage), the %scoremap hash above indicates the percentage of the messages that need to hit on a subrule for that subrule to be included in one of the meta rules.

So, 70% of the mail in the set would need to hit on a subrule for it to be included in the first group, 4% in the second, and 0.01% in the third.

If I read the information flow correctly, this is actually decided by seek-phrases-in-log, which spits out subrules that reached a certain hit rate in blocks, followed by the "# passed hit-rate threshold nnn" line. mk_meta_rule_scores just takes that in, collects the rule names in each block, and spits out the meta.

(In fact the stock setup refers to mk_meta_rule, although it's nearly identical to mk_meta_rule_scores.)

By raising the hit percentage to 70%, you're requiring that 70% of the spam you're using must hit on one of the subrules. TBH, by that point, you may as well hand-extract a couple of the subrules and make them static, standalone rules.

My experience has been that you need lots of mail to generate multiple metas in any case; I've taken a different tack and separated out different groups of mail for different generated rule sets instead.

-kgd

Reply via email to