I have an idea for a spambayes variation that should be more suited to multi-user systems. The goal is to make the DB somewhat conditionalized based on recipient address. In addition to storing <token>, spambayes could also save (<recipient>, <token>). When scoring a message, the probability for (<recipient>, <token>) would be added to the evidence as well as for <token>.
I'm looking at chi2_spamprob() and wondering if this is valid, statistics-wise. Is there some better way to include the (<recipient>, <token>) evidence? BTW, if this idea actually works, using (<sender>, <token>) may also be helpful. Neil _______________________________________________ spambayes-dev mailing list spambayes-dev@python.org http://mail.python.org/mailman/listinfo/spambayes-dev