-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Daniel Quinlan writes: > Chris Thielen <[EMAIL PROTECTED]> writes: > > > Second, where is the appropriate place for discussion of FPs? > > Bugzilla, sa-dev or elsewhere? > > If it's a question your mail, I'd say sa-dev. If it's a rule you think > could be improved, I'd say bugzilla. If it's someone else results (a > potential FP that you want to ask about), I usually use private email, > maybe bugzilla. I would suggest sa-dev unless it involves some detail of the mail that might not be appropriate to discuss publically. sa-dev in general is good so that, should the mass-check disucssion turn into a general discussion of something else (as they often do!), it's archived and public for people to comment. > > Third, I'm wondering what the thought is on age of ham corpora. I'm > > getting several FPs on (for instance) MPART_ALT_DIFF, some of which are > > from older ham (a few spammy-looking legitimate mailings from > > nextcard.com in 2000). Do I purge these messages from my corpus > > assuming they're from an broken ancient mailer or should they be tallied > > as usual? Do I simply narrow my ham corpus to 6 months or younger like > > my spam corpus? A quick chat with DQ on freenode indicated he uses both > > his full corpus and a smaller/newer subset depending on the occasion. > > I think 2000 is too old for ham, although I think Theo uses some really > old mail in his corpus. For ham, a year is okay, maybe two years at the > outside. You do want to include some older MTAs because the entire > world does not upgrade at once, but I don't believe the nextcard.com > mailer really falls into that category. I'd definitely not scan mail from before Jan 2002, at the oldest. (Don't we have some Super-Official Corpus Guidelines for this? ;) > I generally don't purge much of anything, but I generally do a > --tail=<big number> in my mass-check and move older stuff to another > directory from time to time. - --j. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (GNU/Linux) Comment: Exmh CVS iD8DBQFAqdA5QTcbUG5Y7woRAlonAKCsGgkbfa2XwUsJMFdM4TvBFtFkKACfREGu Si6ZF8uzAun2EsIWIEPDzt4= =9TnE -----END PGP SIGNATURE-----
