Hi,

I looking to know what calculations are made by graham burton to calculate
the probability and confidence.

I look at the source code, and look at the equation of the combined
probability but it's too hard for me :(

I think the probability per token is calculated like this: If the word
"viagra" (example) appears in 400 of 3 000 spam messages in 5 of the 300
legitimate messages, for example, then its spam probability would be 0,8889
(that is, [400/3000] divided by [5 / 300 +400 / 3000])

But for the confidence I do not know?

And what is exactly Pvalue?



Thank you in advance once again,


coma
------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Dspam-user mailing list
Dspam-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspam-user

Reply via email to