On Wed, 30 Mar 2011 16:51:57 +0200
Marcin Mirosław <[email protected]> wrote:
> I'm using postgresql, but machine isn't quick... Any db is slowly
> there.
Using Pg for Bayes data will be really slow. We don't use the SpamAssassin
Bayes implementation and we went through three iterations of storage
back-ends before finding one we liked.
1) PostgreSQL: Convenient but slow.
2) Berkeley DB: Faster than PostgreSQL, but still slow and
occasionally flaky
3) CDB: Very fast, but cannot be incrementally updated. You need to rebuild
the entire DB and then atomically rename it.
In our implementation, it's not a problem to have a read-only DB, so we went
with CDB. It's dramatically faster than Berkeley DB:
http://www.dmo.ca/blog/benchmarking-hash-databases-on-large-data/
Regards,
David.