On Monday 22 August 2016 at 16:45:09, Dianne Skoll wrote: > On Mon, 22 Aug 2016 07:34:00 -0700 Marc Perkel wrote: > > > So. What percentage of emails using your algorithm are actually > > > decidable? > > > > Almost 100% if you look at a wide variety of tokens from multiple > > attributes. > > I can't believe that, or I'm missing something. Almost every spam I see > contains words that also appear in ham. Things like "this" or "invoice" > or "regards" or "dear". > > What am I missing?
I believe you're missing Marc's definition of "token". Antony. -- Anyone that's normal doesn't really achieve much. - Mark Blair, Australian rocket engineer Please reply to the list; please *don't* CC me.