Re: CapitilizationFilterFactory

Yonik Seeley Thu, 31 Jan 2008 09:46:00 -0800

On Jan 31, 2008 12:42 PM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> I have started on SOLR-330 and the first one to tackle is the
> CapitilizationFilterFactory (just starting at the top of the analysis
> package).
>
> At any rate, there are some optimizations to be made here, but one
> thing in the file that is not explicitly stated is that the "keep"
> word list is case-insensitive.  This is the current, undocumented,
> behavior.  I am fine with documenting and making it so going forward.
> However, if, instead, we make it case-sensitive, we can then use a
> CharArraySet (from Lucene) to do quick look ups of the term buffer
> char array.


CharArraySet can be either case sensitive or case insensitive, but you
must specify when creating it.

-Yonik

Re: CapitilizationFilterFactory

Reply via email to