On Wed, Sep 19, 2007 at 05:55:20PM -0400, Dave Koontz wrote:
> Theo and all.  I know this topic comes up on occasion, but I am not sure
> I've ever seen an explanation as to why the bayes_seen file is not auto
> pruned along with the bayes db file.  Since tokens expire in the main DB
> file, what is the purpose of having a seen file to unlearn tokens which
> may have long ago been purged?   IMO, it would seem logical to also
> purge the seen file at some sort of cycle so it can't grow so
> excessively large.

Sure, patches welcome. :)

Seriously, it would require someone to write the code to deal with
expiry, and to upgrade people's seen files (or otherwise handle that
situation), etc.  At a minimum, just adding in a timestamp would help,
but if you wanted to have some mapping of tokens to message, then that's
a whole huge thing.  Oh, and you'd need to support SQL and DBM, of course.

Since you can just rm the seen file or do a "delete from" in SQL, and
have everything continue to work, it hasn't been considered a priority.
But if you think it's important enough to get in, we're happy to accept
the patch to implement it.

-- 
Randomly Selected Tagline:
Forgetfulness, n.:
        A gift of God bestowed upon debtors in compensation for
        their destitution of conscience.

Attachment: pgprYycd4zi6g.pgp
Description: PGP signature

Reply via email to