On Sat, 5 Apr 2014, John Hardin wrote:

On Sat, 5 Apr 2014, Axb wrote:

 On 04/05/2014 07:33 PM, John Hardin wrote:

>   The masscheck spam corpus isn't pathetically small, but at the moment
>   it's *strongly* biased towards the traffic *you* are seeing. Your spam
>   is 490k+ of the 510k total corpus.

 Should I feel guilty for only masschecking the last 21 days?

No, certainly not. But I did want to point out that the corpus is biased at the moment.

Let me amend that: I don't have any idea how diverse your corpora feeds are, so it's entirely possible that your providing the bulk of masscheck spam recently isn't actually causing any bias in the results.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Where are my space habitats? Where is my flying car?
  It's 2010 and all I got from the SF books of my youth
  is the lousy dystopian government.                      -- perlhaqr
-----------------------------------------------------------------------
 8 days until Thomas Jefferson's 271st Birthday

Reply via email to