On Wed, Mar 25, 2015 at 06:39:43AM -0300, j...@7lan.net wrote: > >> > >> I tryed to use postgresql driver, but it used a lot of resources. > >> > >> Can you guys give me some suggestions? The database is getting bigger > >> and I don't know if I'm doing the best maintenance routine. > >> > >> Thanks! > > > > Hi, > > > > I would be leery of using the hash backend for a system with that many > > users using individual training. You are only using ~1MB/user. What > > tokenizer > > are you using? I would expect you to need much more room per user as > > the > > training progresses, 10-100MB each. I think your disk usage is going to > > continue to increase to the point that use a PostgreSQL backend would > > make > > sense. How are you planning to address when a hash file becomes > > corrupt? > > > > Regards, > > Ken > > > I'm using osb tokenizer. The database is "new" that's why it is so small > today. > > I'm planning to put it in a sql backend. What database does dspam works > better? I saw some postgresql schema optimization, but maybe mysql is > less resource eater? What are your experiences? > > Thanks! > Hi,
We currently use MySQL with a MyISAM backends with an old release of DSPAM, version 3.6.x. We are working on an upgrade to the latest release of DSPAM and change to a PostgreSQL backend to allow us to partition that backend tables which will allow us to perform maintenance more easily without impacting concurrent usage: use CLUSTER to keep user tokens adjacent, use a <100% fill-factor to allow for HOT updates, remove old mail signatures with TRUNCATE and not DELETE. Note, for as many users as you have, you may not want to keep the signatures at all and simply retrain the message if you have it available. The resource usage between MySQL and PostgreSQL is similar, as far as I know once you move to InnoDB/XtraDB. Regards, Ken ------------------------------------------------------------------------------ Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user