On Fri, 17 Dec 2004 12:21:18 -0500, "Kenny Pitt" <[EMAIL PROTECTED]>
wrote:

>The "train-on-mistakes-and-unsures" strategy implemented in the Outlook
>addin is believed to be the most effective strategy for most general users.

Is that how the automated training is implemented in the latest CVS
versions? Or are you talking about manual training, starting with an empty
database and correcting any mistakes as new messages arrive?

I was thinking that the "train on mistakes" approach could be taken a step
further, down to the individual token level: all encountered tokens are
stored in the database, but only "activated" for filtering when found to be
required to filter correctly; that is, when a mistake is found, tokens are
activated in order of decreasing significance until classification is
correct. Has anyone tried anything like this?

-- Mat.


_______________________________________________
[EMAIL PROTECTED]
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html

Reply via email to