Re: high scores on HDRS_LCASE,MANY_HDRS_LCASE > FPs

2012-07-04 Thread Axb

On 07/04/2012 04:40 PM, John Hardin wrote:

On Wed, 4 Jul 2012, Axb wrote:


from last update's 72_scores.cf

score HDRS_LCASE3.749 3.999 3.749 3.999
score MANY_HDRS_LCASE   1.251 1.004 1.251 1.004

Although John manually set low scores in the sandbox file, these are
ignored (per design).


They are _limits_. The generator should not exceed those scores. The
newly limited scores may take a bit to show up in an update.


I'll watch those scores closely.


Fixed/forced scores should be set via 73_sandbox_manual_scores.cf and
not in sanbox files

They have comment:
# observed in UCE 9/2009

As they are hitting lots of ham, can we please loose these.

HDRS_LCASE_IMGONLY may be another candidate to be dropped.


Alex, I don't recall if you're running masschecks; if you are, can you
include such FPs in your ham corpus? The reason they're being scored so
highly by the rescorer is they do perform well against the masscheck
corpus.


I am running masschecks but these hits I see on msgs  (maillog) 
gatewayed thru $dayjob's boxes - not stuff stored locally.


I understand, a lot of rules may perform well in masschecks but overall 
generic patterns should be dropped if we detect that real world traffic 
shows they're dangerous.


Imo, we should be able to trust our traffic & judgement more than 
masscheck corpuses which may be highly biased.


Axb



Re: high scores on HDRS_LCASE,MANY_HDRS_LCASE > FPs

2012-07-04 Thread John Hardin

On Wed, 4 Jul 2012, Axb wrote:


from last update's 72_scores.cf

score HDRS_LCASE3.749 3.999 3.749 3.999
score MANY_HDRS_LCASE   1.251 1.004 1.251 1.004

Although John manually set low scores in the sandbox file, these are ignored 
(per design).


They are _limits_. The generator should not exceed those scores. The newly 
limited scores may take a bit to show up in an update.


Fixed/forced scores should be set via 73_sandbox_manual_scores.cf and not in 
sanbox files


They have comment:
# observed in UCE 9/2009

As they are hitting lots of ham, can we please loose these.

HDRS_LCASE_IMGONLY may be another candidate to be dropped.


Alex, I don't recall if you're running masschecks; if you are, can you 
include such FPs in your ham corpus? The reason they're being scored so 
highly by the rescorer is they do perform well against the 
masscheck corpus.


--
 John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
 jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
---
  Ignorance is no excuse for a law.
---
 Today: the 236th anniversary of the Declaration of Independence


Re: [auto] bad sandbox rules report

2012-07-04 Thread Axb

On 07/04/2012 01:44 PM, Benny Pedersen wrote:

Den 2012-07-04 10:44, Axb skrev:


Could we agree to remove/disable "no hits at all" rules?


no, spammers would start using this taktics after disable

is there an minimum time for not hitting ?, or just not hitting last
masscheck ?


This would speed up masschecks quite a bit.


oh fair :)


(also, never hurts to do a cleanup)


it does not take long if done daily, but if it done on each 365 day it
takes 365 days, i know this from expirence :)



Are you a commiter?



Re: [auto] bad sandbox rules report

2012-07-04 Thread Benny Pedersen

Den 2012-07-04 10:44, Axb skrev:


Could we agree to remove/disable "no hits at all" rules?


no, spammers would start using this taktics after disable

is there an minimum time for not hitting ?, or just not hitting last 
masscheck ?



This would speed up masschecks quite a bit.


oh fair :)


(also, never hurts to do a cleanup)


it does not take long if done daily, but if it done on each 365 day it 
takes 365 days, i know this from expirence :)




Re: [auto] bad sandbox rules report

2012-07-04 Thread Axb

Good day Commiters,

Could we agree to remove/disable "no hits at all" rules?

This would speed up masschecks quite a bit.
(also, never hurts to do a cleanup)

Thanks

Axb


high scores on HDRS_LCASE,MANY_HDRS_LCASE > FPs

2012-07-04 Thread Axb

from last update's 72_scores.cf

score HDRS_LCASE3.749 3.999 3.749 3.999
score MANY_HDRS_LCASE   1.251 1.004 1.251 1.004

Although John manually set low scores in the sandbox file, these are 
ignored (per design).


Fixed/forced scores should be set via 73_sandbox_manual_scores.cf and 
not in sanbox files


They have comment:
# observed in UCE 9/2009

As they are hitting lots of ham, can we please loose these.

HDRS_LCASE_IMGONLY may be another candidate to be dropped.


Thanks

Axb.