Hi, I noticed that words, common to all mails, seem to get at spamvalue of close to zero, as in:
0.035 47222 446614 1086615228 Subject Why is'nt it close to 0.500? As far as I can see, the word "Subject" should have exactly no influence on spammishnes since it is always there. The current effect must be that any mail gets biased towards ham. I could counteract this by adjusting my BAYES_* scores, but I'd like to have this list's input on the matter. The value might be because much less spam than ham has been used for (automatic) training (apparently 47222 vs. 446614), but why should that influence the spammishness of other mails? - regards, Ole.
