Hi there,
Just a quick question that the wiki page (
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters) didn't seem to
answer very well.
Given an analyzer that has zero or more Char Filter Factories, one
Tokenizer Factory, and zero or more Token Filter Factories, which value(s)
are
to the entire input stream and PatternReplaceFilterFactory
is applied to each token emitted by the tokenizer.
And to make it even more fun, you can do both!
Best
Erick
On Wed, Apr 13, 2011 at 8:14 AM, Ben Davies ben.dav...@gmail.com wrote:
Hi there,
Just a quick question that the wiki page
I can't remember where I read it, but I think MappingCharFilterFactory is
prefered.
There is an example in the example schema.
charFilter class=solr.MappingCharFilterFactory
mapping=mapping-ISOLatin1Accent.txt/
From this, I get:
org.apache.solr.analysis.MappingCharFilterFactory