Hi, I noticed that words, common to all mails, seem to get at spamvalue of
close to zero, as in:

0.035      47222     446614 1086615228  Subject

Why is'nt it close to 0.500? As far as I can see, the word "Subject" should
have exactly no influence on spammishnes since it is always there.

The current effect must be that any mail gets biased towards ham. I could
counteract this by adjusting my BAYES_* scores, but I'd like to have this
list's input on the matter.

The value might be because much less spam than ham has been used for
(automatic) training (apparently 47222 vs. 446614), but why should that
influence the spammishness of other mails?

- regards, Ole.



Reply via email to