powermail-discuss Digest #2883 - Monday, September 8, 2008
SpamSieve slowness
by "Jeremy Hughes" <[EMAIL PROTECTED]>
Re: SpamSieve slowness
by "Michael Tsai" <[EMAIL PROTECTED]>
----------------------------------------------------------------------
Subject: SpamSieve slowness
From: "Jeremy Hughes" <[EMAIL PROTECTED]>
Date: Mon, 8 Sep 2008 18:25:14 +0100
Hi,
SpamSieve seems to have become really slow for me recently, and I
suspect it is because my spam corpus is pretty large (345,672 messages,
2,390,729 words).
Does this seem unreasonably large?
There used to be a Prune Corpus option, but this has disappeared -
should I just open the corpus and delete everything that was "Last Used"
before 2008?
I first noticed the slowness after moving from Tiger to Leopard a few
weeks back - not sure if this is connected.
Jeremy
----------------------------------------------------------------------
Subject: Re: SpamSieve slowness
From: "Michael Tsai" <[EMAIL PROTECTED]>
Date: Mon, 8 Sep 2008 13:38:27 -0400
On Sep 8, 2008, at 1:25 PM, Jeremy Hughes wrote:
> SpamSieve seems to have become really slow for me recently, and I
> suspect it is because my spam corpus is pretty large (345,672
> messages,
> 2,390,729 words).
>
> Does this seem unreasonably large?
Yes, it's normal to have under 2,000 messages and 200,000 words in the
corpus.
> There used to be a Prune Corpus option, but this has disappeared -
> should I just open the corpus and delete everything that was "Last
> Used"
> before 2008?
That would certainly make it faster. It would probably be better for
the accuracy, however, if you reset the corpus and then re-trained
SpamSieve with a smaller number of recent messages:
<http://c-command.com/spamsieve/manual-ah/using-spamsieve-with-yo>
SpamSieve's auto-training feature has been improved since you started
using it, so the corpus will no longer grow so large by itself.
> I first noticed the slowness after moving from Tiger to Leopard a few
> weeks back - not sure if this is connected.
Yes, it runs faster on Tiger. Apple made some of the APIs much slower
in Leopard.
--
Michael Tsai <http://c-command.com>
----------------------------------------------------------------------
End of powermail-discuss Digest