On Tue 24 August 2004, 12:30:49 +1000, Drago wrote:
> ok im kinda confused here, I've checked these 3 macros out
> 

<...snip...>


> if my total spam is 270 and total clean is 246, shouldn't my spam percentage
> be 51% - 52% since my overall spam percentage is greater than clean or is the
> 270 my complete mails, 246 clean and the 24 left over spam
> 
> i cant see how i get 2% and below spam when my spam is greater than clean
> mails
> 
> anyone want to help me out here?


All your calculations involve fractions such as:
   %STATNONSPAMLETTERS(168)/%STATSPAMLETTERS(168)
   %STATSPAMERLETTERS(168)/%STATSPAMLETTERS(168)

You are quoting these as if they were percentages, but to covert them to
a percentage you should multiply them by 100

So, for instance, %STATNONSPAMLETTERS(168)/%STATSPAMLETTERS(168) is
246/270 which is 0.91111, which is 91.111%

Now I am guessing the meaning of the macros is:
   %STATSPAMLETTERS()    - number of msgs classified as spam in time period
   %STATNONSPAMLETTERS() - number of msgs classified as nonspam in time period
   %STATSPAMERLETTERS()  - number of msgs classified as spam but
                           reclassified as nonspam in time period

Lets abbreviate (for simplicity) as follows:
   %STATSPAMLETTERS()    - %s
   %STATNONSPAMLETTERS() - %n
   %STATSPAMERLETTERS()  - %e

Then the correct calculations are:
   Percentage BayesIT guessed wrong = 100*%e/(%s+%n)
   Percentage BayesIT guessed right = 100*(1-%e/(%s+%n))
   Percentage of email that is spam = 100*(%s/(%s+%n))


-- 
Robin

Using The Bat! v2.12.00 on Windows XP 5.1 Build 2600 Service Pack 1






________________________________________________
Current version is 2.12.00 | 'Using TBUDL' information:
http://www.silverstones.com/thebat/TBUDLInfo.html

Reply via email to