-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Robert Menschel writes: > Tuesday, August 24, 2004, 7:02:00 PM, you wrote: > JM> That's the main issue that we had in the past with "external" > JM> rulesets -- most of those were developed without measuring accuracy, > JM> and once tested they don't come out too hot. But from what I can see > JM> (from outside) it looks like you all have been doing that for a > JM> while, which is cool. > > Yes. We post the rules to each other, run them through two or more > (usually three) corpora, and use the combined results to determine > whether rules are viable. (We're hoping to add a fourth corpus soon.) > > Viable to us is less strict than viable to the development team, lower > thresholds, but the basic philosophies are the same, I think. Yep, from what I can see, agreed ;) > JM> (BTW I should qualify what Daniel means by "non-heavyweight" -- in > JM> other words, the rule doesn't greatly affect speed/RAM usage. I > JM> think that's what he means at least.) > > Also important to us. My system, for instance, does a comprehensive > mass-check on a single rule to dozens of rules in about half an hour. If > any rule causes a noticeable jump in this performance measure, we either > fix it or toss it. > > (I can't really measure RAM usage on my system, but the same concern > applies.) > > We've also been trying to some extent to document a rule's history, so we > know whether it came from a CLA member or elsewhere. We're discussing > ways of making that more formal. The version 2 Apache license has some text allowed "trivial" contributions to not require CLAs -- but then, what's the definition of "trivial"? we haven't got a really good definition of that as it applies to rules yet, unfortunately ;) > JM> If we can work something out, that'll be great ;) > > We're all agreed about that. I'm hopeful we can. cool. - --j. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (GNU/Linux) Comment: Exmh CVS iD8DBQFBLVedQTcbUG5Y7woRAoR2AKDibOVadoq72mOsUiSRc1eMVM2SEwCfU312 iiAoHwRTNkPlN0RX/yODUU0= =VPad -----END PGP SIGNATURE-----
