On Jan 31, 2008 12:42 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > I have started on SOLR-330 and the first one to tackle is the > CapitilizationFilterFactory (just starting at the top of the analysis > package). > > At any rate, there are some optimizations to be made here, but one > thing in the file that is not explicitly stated is that the "keep" > word list is case-insensitive. This is the current, undocumented, > behavior. I am fine with documenting and making it so going forward. > However, if, instead, we make it case-sensitive, we can then use a > CharArraySet (from Lucene) to do quick look ups of the term buffer > char array.
CharArraySet can be either case sensitive or case insensitive, but you must specify when creating it. -Yonik