I have some questions about the how the Bayesian filter chooses the
numbers for spam file names. The numbers it uses in a given day are all
over the place. That is fine, and I really don't care.

What I care about is that sometimes on the same day a spam email
arrives, it overwrites another received that same day. Or the file no
longer exists between say midnight and like 8AM. When I look around I
can see spam emails that arrived days ago, some months ago.

So while I can't say it seems to follow numeric order. It doesn't seem
to follow date order either. Short of reviewing code, I am looking for
some info on how the Bayesian filter chooses file name numbers. Also
which emails/numbers it would remove/recycle.

On a related side note. Should one ever replace, start over, etc with
the bayesian database? Not saying I will clean out the emails stored in
folders. But I have a 3+ year old Bayesian database. Just curious if
there are any draw backs to the Bayesian db over time.

Thanks

-- 
William L. Thomson Jr.
Gentoo/Java

Attachment: signature.asc
Description: This is a digitally signed message part

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Assp-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-user

Reply via email to