On Fri, 17 Dec 2004 12:21:18 -0500, "Kenny Pitt" <[EMAIL PROTECTED]> wrote:
>The "train-on-mistakes-and-unsures" strategy implemented in the Outlook >addin is believed to be the most effective strategy for most general users. Is that how the automated training is implemented in the latest CVS versions? Or are you talking about manual training, starting with an empty database and correcting any mistakes as new messages arrive? I was thinking that the "train on mistakes" approach could be taken a step further, down to the individual token level: all encountered tokens are stored in the database, but only "activated" for filtering when found to be required to filter correctly; that is, when a mistake is found, tokens are activated in order of decreasing significance until classification is correct. Has anyone tried anything like this? -- Mat. _______________________________________________ [EMAIL PROTECTED] http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html
