On Monday 22 August 2016 at 16:45:09, Dianne Skoll wrote:

> On Mon, 22 Aug 2016 07:34:00 -0700 Marc Perkel wrote:
> > > So.  What percentage of emails using your algorithm are actually
> > > decidable?
> > 
> > Almost 100% if you look at a wide variety of tokens from multiple
> > attributes.
> 
> I can't believe that, or I'm missing something.  Almost every spam I see
> contains words that also appear in ham.  Things like "this" or "invoice"
> or "regards" or "dear".
> 
> What am I missing?

I believe you're missing Marc's definition of "token".


Antony.

-- 
Anyone that's normal doesn't really achieve much.

 - Mark Blair, Australian rocket engineer

                                                   Please reply to the list;
                                                         please *don't* CC me.

Reply via email to