-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

jdow writes:
>I have sympathies for your problem. As a user I can see allowing you to
>use something like a line in procmail to clone any email that is tagged
>as a virus with a high enough score, say 10 or 15, to be used to maintain
>a corpus. I'd have sincere problems with the idea of your pulling
>messages to build a ham corpus, though. So you'd end up with a really
>huge spam corpus and a puny and likely biased ham corpus.

Worse -- the spam corpus would be entirely biased towards a particular
sub-set of spam (the easily identifiable stuff).  That's no good...

>And then you'd still be faced with the task of processing all these
>diverse spam captures to eliminate duplicates.

Well, duplicates aren't a huge problem really.  Getting rid of those
is a best-case scenario.

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFAnB5fQTcbUG5Y7woRAkhrAJ9d3hxJT1MidgIyBnUHfsYVrMWZcwCgzNTf
fr8vq4q+Kg6E5ty/9SPKen8=
=tmML
-----END PGP SIGNATURE-----

Reply via email to