On Wed, 20 Jan 2016 16:26:19 -0500 Matt Garretson wrote: > I am not an expert but it does seem like the main novel thing is how > (and how many) multi-word tokens are generated. I use have been using > multi-word tokens with bogofilter for years and it does help. Of > course bogofilter only uses adjacent words -- perhaps OP's way of > combining words could yield an increase in accuracy, at the expense > of processing time.
It's exactly the same as bogofilter.
