John Morse wrote:

> I have uploaded my entire "Bayesit" directory, so that anyone can
> download it and have the pleasure of using one of the finest spam
> filters anywhere. You will not have to train it with so many emails,

<snip>

> Don't worry about training Bayesit with a bunch of messages, just go
> through the motions, because this will replace all the training
> anyway.

Nice gesture, but a bad idea IMO.  The beauty of Bayesian filtering is
that it learns from an individual's email.  Other users may well get
spam of a different nature than yours and, aside from TB! mailing
lists, will almost certainly receive a different assortment of good
mail.

Here's an excerpt from an interesting article on Bayesian filtering:

,----- [ http://www.paulgraham.com/spam.html ]
| Ideally, of course, the probabilities should be calculated
| individually for each user. I get a lot of email containing the word
| "Lisp", and (so far) no spam that does. So a word like that is
| effectively a kind of password for sending mail to me. In my earlier
| spam-filtering software, the user could set up a list of such words
| and mail containing them would automatically get past the filters. On
| my list I put words like "Lisp" and also my zipcode, so that
| (otherwise rather spammy-sounding) receipts from online orders would
| get through. I thought I was being very clever, but I found that the
| Bayesian filter did the same thing for me, and moreover discovered of
| a lot of words I hadn't thought of.
`-----

I haven't used BayesIt, but the training period for POPFile was
remarkably short.  Well worth the effort to have it know *my* mail.

-- 
George

Using TB! 2.02.3 CE on Windows XP Pro 5.1, Build 2600, Service Pack 1.


________________________________________________
Current version is 2.02.3 CE | "Using TBUDL" information:
http://www.silverstones.com/thebat/TBUDLInfo.html

Reply via email to