> I just want to see if it's safe to use two different analyzers for the
> following situation:
>
> I have an index that I want to preserve case with so I can do
> case-sensitive
> searches with my WhitespaceAnalyzer.  However, I also want to do case
> insensitive searches.

you should also make sure the data is indexed twice, once w/ the original
> case and once w/o. It's like putting a TokenFilter after
> WhitespaceTokenizer
> which returns two tokens - lowercased and the original, both in the same
> position (set posIncr to 0).
>

I finally got around to really needing this, and I'm just a little confused
by the implementation.  Should I physically use two different indexes (one
with StandardAnalyzer, one with WhitespaceAnalyzer?), two separate fields (I
don't think that's possible?), or could you explain your idea a little
more?  Should I implement my own WhitespaceTokenizer with the TokenFilter?

Thanks.

Reply via email to