On Wed, Mar 25, 2015 at 06:39:43AM -0300, j...@7lan.net wrote:
> >> 
> >> I tryed to use postgresql driver, but it used a lot of resources.
> >> 
> >> Can you guys give me some suggestions? The database is getting bigger
> >> and I don't know if I'm doing the best maintenance routine.
> >> 
> >> Thanks!
> > 
> > Hi,
> > 
> > I would be leery of using the hash backend for a system with that many
> > users using individual training. You are only using ~1MB/user. What 
> > tokenizer
> > are you using? I would expect you to need much more room per user as 
> > the
> > training progresses, 10-100MB each. I think your disk usage is going to
> > continue to increase to the point that use a PostgreSQL backend would 
> > make
> > sense. How are you planning to address when a hash file becomes 
> > corrupt?
> > 
> > Regards,
> > Ken
> > 
> I'm using osb tokenizer. The database is "new" that's why it is so small 
> today.
> 
> I'm planning to put it in a sql backend. What database does dspam works 
> better? I saw some postgresql schema optimization, but maybe mysql is 
> less resource eater? What are your experiences?
> 
> Thanks!
> 
Hi,

We currently use MySQL with a MyISAM backends with an old release of DSPAM,
version 3.6.x. We are working on an upgrade to the latest release of DSPAM
and change to a PostgreSQL backend to allow us to partition that backend
tables which will allow us to perform maintenance more easily without
impacting concurrent usage: use CLUSTER to keep user tokens adjacent, use
a <100% fill-factor to allow for HOT updates, remove old mail signatures
with TRUNCATE and not DELETE. Note, for as many users as you have, you
may not want to keep the signatures at all and simply retrain the message
if you have it available. The resource usage between MySQL and PostgreSQL
is similar, as far as I know once you move to InnoDB/XtraDB.

Regards,
Ken

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to