https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6667
Mark Martinec <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Version|unspecified |3.3.2 Target Milestone|Undefined |3.4.0 --- Comment #1 from Mark Martinec <[email protected]> 2011-10-03 16:42:09 UTC --- >> but increases database size by about a factor of 8 to 10." > > Unless someone can come up with a good reason why the claim about database > size is true, I would suggest it be removed. Also, on the ML, RW said: > I've never understood what this is supposed to mean, and I suspect it > it's just plain wrong. bayes_use_hapaxes determines whether hapaxes > (tokens with a total count of 1) are used in the calculation. It > doesn't affect whether they are stored; and it can't since all tokens > start-off as hapaxes. It might have a marginal effect through the > updating of atimes, but in that case it's expediting the removal > of the most useful hapaxes. Examining the code I agree this is indeed the case, I don't see how bayes_use_hapaxes setting would have any impact on the size of a database. Unless someone can show otherwise, I'll strike the statement about increasing the database size from the docs: trunk: [doc] Bug 6667: bayes_use_hapaxes and dubious claim about database size Sending lib/Mail/SpamAssassin/Conf.pm Committed revision 1178448. -- Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug.
