-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
John Peacock writes: > Justin Mason wrote: > > > as a matter of interest, how much disk space does this database > > of as-yet-untrained tokens take up? It's something we've considered > > implementing in SpamAssassin, but the disk space issue is an > > important datum before considering it. > > This is not for the faint of heart. dspam is a major app which make > very heavy use of the backend database (with MySQL being the most common > and fastest backend). On a freshly installed server using a dump of my > production database, my entire dspam database is 1.7GB, of which the > token table is 266M/336M (data/index); I don't have a breakdown of how > much of that is untrained vs trained. The signature table is 1.1G/9.0M > for comparison. > > For comparison, my production server is 7.2GB, on the other hand > reflecting the high water mark (since MySQL never shrinks the tables > unless optimized manually). wow -- that's massive! ;) How many users does that account for? - --j. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.5 (GNU/Linux) Comment: Exmh CVS iD8DBQFCEkoSMJF5cimLx9ARAg9/AJ41DwjX0/Ukw+jMRECJVlniJ0yysACeIhXC 1qtVic96VD/vS0WF/5qu1RU= =PNpn -----END PGP SIGNATURE-----
