1. A few months ago, several good messages were being tagged with the STATISTIC flag. Many three-letter words (such as 'and' 'the' 'nor' 'but') had a high 'spam' number (eg 0,57).

2. My example shows new words with 0.45 when these are fairly common: 'bluecross' 'blueshield'.

What do you mean by 'for this specific user'? This server relays for two domains: primary for one and secondary for other. The server simply tags the messages and the second server (IMail 7.07, mailboxes) sorts.

Please offer a suggestion for improvement of method.

adamc


Scott Perry wrote:

You shouldn't.

The whole idea behind Bayesian filtering is that you can predict the chances of an E-mail being spam based on data about prior spam. If you change the data to something that it wasn't, you will change the outcome of the Bayesian filtering.

Changing the numbers is no different than taking a mean profit margin of 12.2% and rounding it up to 13% for your boss. It may seem nicer, it may make your boss happier, it may take up less space on a report. But it won't be mathematically accurate. It won't be mathematically accurate even with correct data (Bayesian filtering is really just a guesstimate, based loosely on Bayes' Theorem), but there is mathematical logic that gets broken by playing with the numbers.

If you haven't trained the Bayesian filtering for the spam and legitimate E-mail for this specific user, you should do so. Note that Bayesian filtering works much, much better when it is trained for each specific user (otherwise, for example, a mortgage broker will lose a lot of legitimate E-mail, due to the preponderance of mortgage related spam that other users get).
-Scott



To Unsubscribe: http://www.ipswitch.com/support/mailing-lists.html List Archive: http://www.mail-archive.com/imail_forum%40list.ipswitch.com/ Knowledge Base/FAQ: http://www.ipswitch.com/support/IMail/


To Unsubscribe: http://www.ipswitch.com/support/mailing-lists.html List Archive: http://www.mail-archive.com/imail_forum%40list.ipswitch.com/ Knowledge Base/FAQ: http://www.ipswitch.com/support/IMail/

Reply via email to