(11/12/13 6:51), Devon Baumgarten wrote:
Hello,
I am having trouble finding how to remove/ignore whitespace when indexing. The
only answer I have found suggested that it is necessary to write my own
tokenizer. Is this true? I want to remove whitespace and special characters
from the phrase and create N-grams from the result.
How about using one of existing charfilters?
https://builds.apache.org/job/Solr-3.x/javadoc/org/apache/solr/analysis/PatternReplaceCharFilterFactory.html
https://builds.apache.org/job/Solr-3.x/javadoc/org/apache/solr/analysis/MappingCharFilterFactory.html
koji
--
Check out "Query Log Visualizer" for Apache Solr
http://www.rondhuit-demo.com/loganalyzer/loganalyzer.html
http://www.rondhuit.com/en/