Bob> I don't recall that anyone has explained how to trim the database
    Bob> with the Outlook plugin.

Yeah, I sort of glossed over that bit.  I'm not sure there's really a
convenient way to do it either (not being an Outlook user I can't explore
the idea in that context).  If you retain the spam that lands in your junk
box I imagine you could sort by date and delete the oldest N%, then retrain
from scratch.  I'm not sure how well that would work well for your non-junk
box.  Maybe have "old" versions for each of your non-junk boxes (for
instance, have an "old-python" mailbox corresponding to your "python"
mailbox) and move the oldest N% to the old version, then only use the
current versions for retraining.

In any case, this is all very un-automatic.  It might be nice of the Outlook
plugin could do the grunt work for you.

In my procmail environment, I have two training mailboxes, ham and spam.
When I want to trim, I simply sort them by date, trim the oldest N%, then
retrain.  My training regimen (using the train-to-exhaustion script, tte.py)
also has an option to delete messages that train correctly on each pass,
thus limiting growth of those special mailboxes.

Skip

_______________________________________________
[email protected]
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html

Reply via email to