Dear Wiki user, You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The following page has been changed by HarryWagner: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem ------------------------------------------------------------------------------ 1. Download [#http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi KStem] 2. Unpack the jar file 3. Modify the package name on the source files to match your install - 4. Rename KStemFilter.java to KStemFilterFactory.java and modify as follows (apparently there are licensing issues that prevent this code from being included in SOlr or available as a download): + 4. Replace KStemFilter.java with KStemFilterFactory.java (see source listing below. Apparently there are licensing issues that prevent this code from being included in Solr or available as a download) + 5. Build the jar file and drop that into your Solr /lib directory + 6. Modify your schema as follows: + == Schema Changes == + {{{ + <fieldtype name="text_kstem" class="solr.TextField"> + <analyzer type="index"> + <tokenizer class="solr.WhitespaceTokenizerFactory"/> + <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> + <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0"/> + <filter class="solr.LowerCaseFilterFactory"/> + <filter class="org.yourOrgHere.solr.analysis.KStemFilterFactory" cacheSize="20000"/> + <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> + </analyzer> + <analyzer type="query"> + <tokenizer class="solr.WhitespaceTokenizerFactory"/> + <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> + <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> + <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"/> + <filter class="solr.LowerCaseFilterFactory"/> + <filter class="org.yourOrgHere.solr.analysis.KStemFilterFactory" cacheSize="20000"/> + <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> + </analyzer> + </fieldtype> + }}} + + == KStemFilterFactory == {{{#!java /* Copyright 2003, @@ -43, +69 @@ Modified for Solr use: H. Wagner, OCLC 2007-09-29 */ - package your.package.name.here; + package package org.yourOrgHere.solr.analysis;; /** * <p>Title: </p>
