On Wed, Sep 19, 2007 at 05:55:20PM -0400, Dave Koontz wrote: > Theo and all. I know this topic comes up on occasion, but I am not sure > I've ever seen an explanation as to why the bayes_seen file is not auto > pruned along with the bayes db file. Since tokens expire in the main DB > file, what is the purpose of having a seen file to unlearn tokens which > may have long ago been purged? IMO, it would seem logical to also > purge the seen file at some sort of cycle so it can't grow so > excessively large.
Sure, patches welcome. :) Seriously, it would require someone to write the code to deal with expiry, and to upgrade people's seen files (or otherwise handle that situation), etc. At a minimum, just adding in a timestamp would help, but if you wanted to have some mapping of tokens to message, then that's a whole huge thing. Oh, and you'd need to support SQL and DBM, of course. Since you can just rm the seen file or do a "delete from" in SQL, and have everything continue to work, it hasn't been considered a priority. But if you think it's important enough to get in, we're happy to accept the patch to implement it. -- Randomly Selected Tagline: Forgetfulness, n.: A gift of God bestowed upon debtors in compensation for their destitution of conscience.
pgprYycd4zi6g.pgp
Description: PGP signature