-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Daniel Quinlan writes:
> Chris Thielen <[EMAIL PROTECTED]> writes:
> 
> > Second, where is the appropriate place for discussion of FPs?
> > Bugzilla, sa-dev or elsewhere?
> 
> If it's a question your mail, I'd say sa-dev.  If it's a rule you think
> could be improved, I'd say bugzilla.  If it's someone else results (a
> potential FP that you want to ask about), I usually use private email,
> maybe bugzilla.

I would suggest sa-dev unless it involves some detail of the mail that
might not be appropriate to discuss publically.  sa-dev in general is good
so that, should the mass-check disucssion turn into a general discussion
of something else (as they often do!), it's archived and public for people
to comment.

> > Third, I'm wondering what the thought is on age of ham corpora.  I'm
> > getting several FPs on (for instance) MPART_ALT_DIFF, some of which are
> > from older ham (a few spammy-looking legitimate mailings from
> > nextcard.com in 2000).  Do I purge these messages from my corpus
> > assuming they're from an broken ancient mailer or should they be tallied
> > as usual?  Do I simply narrow my ham corpus to 6 months or younger like
> > my spam corpus?  A quick chat with DQ on freenode indicated he uses both
> > his full corpus and a smaller/newer subset depending on the occasion.
> 
> I think 2000 is too old for ham, although I think Theo uses some really
> old mail in his corpus.  For ham, a year is okay, maybe two years at the
> outside.  You do want to include some older MTAs because the entire
> world does not upgrade at once, but I don't believe the nextcard.com
> mailer really falls into that category.

I'd definitely not scan mail from before Jan 2002, at the oldest.
(Don't we have some Super-Official Corpus Guidelines for this? ;)

> I generally don't purge much of anything, but I generally do a
> --tail=<big number> in my mass-check and move older stuff to another
> directory from time to time.

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFAqdA5QTcbUG5Y7woRAlonAKCsGgkbfa2XwUsJMFdM4TvBFtFkKACfREGu
Si6ZF8uzAun2EsIWIEPDzt4=
=9TnE
-----END PGP SIGNATURE-----

Reply via email to