I have an idea for a spambayes variation that should be more suited
to multi-user systems.  The goal is to make the DB somewhat
conditionalized based on recipient address.  In addition to storing
<token>, spambayes could also save (<recipient>, <token>).  When
scoring a message, the probability for (<recipient>, <token>) would
be added to the evidence as well as for <token>.

I'm looking at chi2_spamprob() and wondering if this is valid,
statistics-wise.  Is there some better way to include the
(<recipient>, <token>) evidence?

BTW, if this idea actually works, using (<sender>, <token>) may also
be helpful.

  Neil
_______________________________________________
spambayes-dev mailing list
spambayes-dev@python.org
http://mail.python.org/mailman/listinfo/spambayes-dev

Reply via email to