> -----Original Message----- > From: Matt Kettler [mailto:[EMAIL PROTECTED] > Sent: Tuesday, January 31, 2006 4:45 PM > To: jdow > Cc: users@spamassassin.apache.org > Subject: Re: Post your top 10 from sa-stats > > jdow wrote: > > From: <[EMAIL PROTECTED]> > > > > Kristopher Austin wrote: > >> RANK RULE NAME COUNT %OFRULES > %OFMAIL %OFSPAM > >> %OFHAM > >> ------------------------------------------------------------ > >> 1 HTML_MESSAGE 45870 5.13 > 27.72 70.37 > >> 55.36 > > > > Wait... so 27% of all mail is HTML, 70% of spam is HTML, and 55% of > > ham is HTML? > > > > <<jdow>> > > So what's the problem? (He's not running Bayes or it's badly broken, > > though.) > > > If 55% of HAM is HTML, and 70% of spam is HTML, then at LEAST > 55% of mail must be html. > > Unless of course all ham + all spam is less than 100% of mail. > > In which case where's the magic third category that isn't ham > or spam that is less than 27% HTML to drive the total > percentage down to 27%? >
Correct. I explained the logic in a follow-up to Kristopher Austin's post. Dallas