https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6667

Mark Martinec <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Version|unspecified                 |3.3.2
   Target Milestone|Undefined                   |3.4.0

--- Comment #1 from Mark Martinec <[email protected]> 2011-10-03 16:42:09 
UTC ---
>> but increases database size by about a factor of 8 to 10."
> 
> Unless someone can come up with a good reason why the claim about database
> size is true, I would suggest it be removed.


Also, on the ML, RW said:
> I've never understood what this is supposed to mean, and I suspect it
> it's just plain wrong. bayes_use_hapaxes determines whether hapaxes
> (tokens with a total count of 1) are used in the calculation. It
> doesn't affect whether they are stored; and it can't since all tokens
> start-off as hapaxes. It might have a marginal effect through the
> updating of atimes, but in that case it's expediting the removal
> of the most useful hapaxes.


Examining the code I agree this is indeed the case, I don't see
how bayes_use_hapaxes setting would have any impact on the size
of a database.

Unless someone can show otherwise, I'll strike the statement about
increasing the database size from the docs:

trunk:
  [doc] Bug 6667: bayes_use_hapaxes and dubious claim about database size
  Sending lib/Mail/SpamAssassin/Conf.pm
Committed revision 1178448.

-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to