On Sat, 5 Apr 2014, John Hardin wrote:
On Sat, 5 Apr 2014, Axb wrote:
On 04/05/2014 07:33 PM, John Hardin wrote:
> The masscheck spam corpus isn't pathetically small, but at the moment
> it's *strongly* biased towards the traffic *you* are seeing. Your spam
> is 490k+ of the 510k total corpus.
Should I feel guilty for only masschecking the last 21 days?
No, certainly not. But I did want to point out that the corpus is biased at
the moment.
Let me amend that: I don't have any idea how diverse your corpora feeds
are, so it's entirely possible that your providing the bulk of masscheck
spam recently isn't actually causing any bias in the results.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
Where are my space habitats? Where is my flying car?
It's 2010 and all I got from the SF books of my youth
is the lousy dystopian government. -- perlhaqr
-----------------------------------------------------------------------
8 days until Thomas Jefferson's 271st Birthday