> The trailing full-stop above is not > being matched when searching for "sage 200" for the below > field type... > > Do I need the WordDelimiterFilterFactory for this to work as > expected? I don't see any mention of periods being discussed > in the docs. > > > <fieldType name="textgen" class="solr.TextField" > positionIncrementGap="100"> > <analyzer type="index"> > <tokenizer > class="solr.WhitespaceTokenizerFactory"/> > <filter > class="solr.SynonymFilterFactory" > synonyms="textgen-synonyms.txt" ignoreCase="true" > expand="true"/> > <filter > class="solr.LowerCaseFilterFactory"/> > </analyzer> > <analyzer type="query"> > <tokenizer > class="solr.WhitespaceTokenizerFactory"/> > <filter > class="solr.SynonymFilterFactory" > synonyms="textgen-synonyms.txt" ignoreCase="true" > expand="true"/> > <filter > class="solr.LowerCaseFilterFactory"/> > </analyzer> > </fieldType> >
White space tokenizer leaves periods. Either use StandardTokenizer or include WordDelimeterFilter. Analysis page visualizes created tokens, it is useful when testing/understanding tokenizer/filter behavior.