-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

John Peacock writes:
> Justin Mason wrote:
> 
> > as a matter of interest, how much disk space does this database
> > of as-yet-untrained tokens take up?   It's something we've considered
> > implementing in SpamAssassin, but the disk space issue is an
> > important datum before considering it.
> 
> This is not for the faint of heart.  dspam is a major app which make 
> very heavy use of the backend database (with MySQL being the most common 
> and fastest backend).  On a freshly installed server using a dump of my 
> production database, my entire dspam database is 1.7GB, of which the 
> token table is 266M/336M (data/index); I don't have a breakdown of how 
> much of that is untrained vs trained.  The signature table is 1.1G/9.0M 
> for comparison.
> 
> For comparison, my production server is 7.2GB, on the other hand 
> reflecting the high water mark (since MySQL never shrinks the tables 
> unless optimized manually).

wow -- that's massive! ;)   How many users does that account for?

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.5 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFCEkoSMJF5cimLx9ARAg9/AJ41DwjX0/Ukw+jMRECJVlniJ0yysACeIhXC
1qtVic96VD/vS0WF/5qu1RU=
=PNpn
-----END PGP SIGNATURE-----

Reply via email to