Commit access to svn

Warren Togami Sun, 27 Sep 2009 22:45:21 -0700

Hi folks,

What do I need to do to gain commit access? I sent in the signed ApacheCLA a few weeks ago but I heard nothing back.

My plans initially are only to put new tests into the sandbox to see howthey do.

* Get Adam Katz's KHOP rules updated in the sandbox so they can beproperly tested.


* Sandbox testing of additional blacklists like JMF, SEM

* Split PSBL into sub rules. RCVD_IN_PSBL is currently looking at allheaders instead of just last-external. This can work very well. But Ibelieve there is a simple way to improve this furter by splitting itinto two subrules. This change can be made after the GA rescoring ifthe rule is split properly.

Use RCVD_IN_PSBL_2WEEKS to assign a score. RCVD_IN_PSBL_DEEP would bete equivalent to RCVD_IN_PSBL_2WEEKS. The stricter RCVD_IN_PSBL wouldbe a subrule that matches only with last-external, thereby beingstricter and eliminating most of the already mininuscule chance of falsepositives. Thus the full score of RCVD_IN_PSBL_2WEEKS would be splitinto two parts.


Before
RCVD_IN_PSBL_2WEEKS score 2
This rule does deep parsing which is often good, but sometimes bad.

After
RCVD_IN_PSBL score 2
This rule matces only last-external making it safer from FP's.
RCVD_IN_PSBL_DEEP score -1

This rule is can be scored separately, subtracting a tiny amount if thePSBL hit was found in deep parsing. Both rules would trigger, one adds,the second subtracts. The subtracting rule would never fire on its own.

* I am also looking at ways to expand the use of the SOUGHT methodology.Either improve the existing SOUGHT, or launch a separate SOUGHT-likechannel based upon an entirely different corpus. For example, Japanesespam trap corpus + Japanese ham corpus = SOUGHT-JP nightly sa-updatechannel. I'm even seeing big spam differences between jm's corpusgenerated sought rules and my own corpus. There is room for improvementwith the current SOUGHT.


Warren Togami
[email protected]

Commit access to svn

Reply via email to