On Tue 24 August 2004, 12:30:49 +1000, Drago wrote:
> ok im kinda confused here, I've checked these 3 macros out
>
<...snip...>
> if my total spam is 270 and total clean is 246, shouldn't my spam percentage
> be 51% - 52% since my overall spam percentage is greater than clean or is the
> 270 my complete mails, 246 clean and the 24 left over spam
>
> i cant see how i get 2% and below spam when my spam is greater than clean
> mails
>
> anyone want to help me out here?
All your calculations involve fractions such as:
%STATNONSPAMLETTERS(168)/%STATSPAMLETTERS(168)
%STATSPAMERLETTERS(168)/%STATSPAMLETTERS(168)
You are quoting these as if they were percentages, but to covert them to
a percentage you should multiply them by 100
So, for instance, %STATNONSPAMLETTERS(168)/%STATSPAMLETTERS(168) is
246/270 which is 0.91111, which is 91.111%
Now I am guessing the meaning of the macros is:
%STATSPAMLETTERS() - number of msgs classified as spam in time period
%STATNONSPAMLETTERS() - number of msgs classified as nonspam in time period
%STATSPAMERLETTERS() - number of msgs classified as spam but
reclassified as nonspam in time period
Lets abbreviate (for simplicity) as follows:
%STATSPAMLETTERS() - %s
%STATNONSPAMLETTERS() - %n
%STATSPAMERLETTERS() - %e
Then the correct calculations are:
Percentage BayesIT guessed wrong = 100*%e/(%s+%n)
Percentage BayesIT guessed right = 100*(1-%e/(%s+%n))
Percentage of email that is spam = 100*(%s/(%s+%n))
--
Robin
Using The Bat! v2.12.00 on Windows XP 5.1 Build 2600 Service Pack 1
________________________________________________
Current version is 2.12.00 | 'Using TBUDL' information:
http://www.silverstones.com/thebat/TBUDLInfo.html