Field Analyzers: which values are indexed?

2011-04-13 Thread Ben Davies
Hi there, Just a quick question that the wiki page ( http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters) didn't seem to answer very well. Given an analyzer that has zero or more Char Filter Factories, one Tokenizer Factory, and zero or more Token Filter Factories, which value(s) are

Re: Field Analyzers: which values are indexed?

2011-04-13 Thread Ben Davies
to the entire input stream and PatternReplaceFilterFactory is applied to each token emitted by the tokenizer. And to make it even more fun, you can do both! Best Erick On Wed, Apr 13, 2011 at 8:14 AM, Ben Davies ben.dav...@gmail.com wrote: Hi there, Just a quick question that the wiki page

Re: question on solr.ASCIIFoldingFilterFactory

2011-04-05 Thread Ben Davies
I can't remember where I read it, but I think MappingCharFilterFactory is prefered. There is an example in the example schema. charFilter class=solr.MappingCharFilterFactory mapping=mapping-ISOLatin1Accent.txt/ From this, I get: org.apache.solr.analysis.MappingCharFilterFactory