> I just want to see if it's safe to use two different analyzers for the > following situation: > > I have an index that I want to preserve case with so I can do > case-sensitive > searches with my WhitespaceAnalyzer. However, I also want to do case > insensitive searches.
you should also make sure the data is indexed twice, once w/ the original > case and once w/o. It's like putting a TokenFilter after > WhitespaceTokenizer > which returns two tokens - lowercased and the original, both in the same > position (set posIncr to 0). > I finally got around to really needing this, and I'm just a little confused by the implementation. Should I physically use two different indexes (one with StandardAnalyzer, one with WhitespaceAnalyzer?), two separate fields (I don't think that's possible?), or could you explain your idea a little more? Should I implement my own WhitespaceTokenizer with the TokenFilter? Thanks.