On Wed, Jan 13, 2010 at 1:50 PM, Ted Dunning <[email protected]> wrote: > > Aside from that, you might try using the white space analyzer (much faster > than the standard). This loses stemming and stop words, but that might be > OK for you. >
StandardAnalyzer doesn't do stemming, but if you want a little faster than that one, SimpleAnalyzer is just letter-char tokenizing + lowercasing (you want that one!) is pretty damn fast. -jake
