On Thu, 2011-10-27 at 17:28 -0700, Peter Williams wrote:
> Hi all,
> 
> Around a month ago, I set up a new server and installed dspam on it.
> Things have been running smoothly until today, when suddenly the
> performance of dspam has slowed to a crawl. Even simple messages take a
> few seconds to process (when they used to go by instantaneously), and a
> single HTML or base64-encoded message can send my load average up to 5.
> 
> Any thoughts as to how this might have happened? I'm quite sure that the
> character of my email traffic hasn't suddenly changed, and the rest of
> the system seems to be functioning smoothly. Can dspam learn enormous
> tokens, and if it did so, would that slow down subsequent processing?

Investigating deeper, I think I've crossed some threshold with the hash
driver. I just now tried the (undocumented?) "heavy" option to cssclean,
clearing out some extra tokens from my CSS files, and the performance
has gotten back to where it was before. (I've been running cssclean
daily.)

(One thing I should have mentioned was that the performance issues are
definitely I/O-related, not CPU-related.)

I noticed that some of my CSS files have just begun filling up their
second extent, so maybe there's some nonlinearity in the performance
there. The "heavy" cssclean removed the second extent from one user's
file that I checked, but another user's mail is back to good performance
even though the CSS file still has two extents.

The general tone from web postings is that the hash driver needs a lot
of attention, but most of those postings seem to be a few years old. Is
this sentiment still accurate?

Peter

> 
> System config info below.
> 
> Thanks for any ideas,
> 
> Peter
> 
> ====================
> OS: CentOS 6
> DSpam: 3.10.1 (EPEL build, release 1.el6)
> Context: virtualized server, ~750 MB memory allocated
> 
> Incoming mail is accepted by exim, passed to a dspam server on LMTP,
> and delivered by Dovecot communicating over LMTP as well.
> 
> Storage driver: libhash_drv
> Other config:
> 
> TrainingMode toe
> TestConditionalTraining on
> Feature whitelist
> Algorithm graham burton
> Tokenizer osb
> PValue bcr
> HashRecMax 6291469
> HashAutoExtend on
> HashMaxExtens 0
> HashExtentSize 49157
> HashPctIncrease 10
> HashMaxSeek 10
> HashConnectionCache 10
> Broken case
> 

-- 
Peter Williams / pe...@newton.cx


------------------------------------------------------------------------------
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to