Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change 
notification.

The following page has been changed by HarryWagner:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem

------------------------------------------------------------------------------
   1. Download [#http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi KStem]
   2. Unpack the jar file
   3. Modify the package name on the source files to match your install
-  4. Rename KStemFilter.java to KStemFilterFactory.java and modify as follows 
(apparently there are licensing issues that prevent this code from being 
included in SOlr or available as a download):
+  4. Replace KStemFilter.java with KStemFilterFactory.java (see source listing 
below.  Apparently there are licensing issues that prevent this code from being 
included in Solr or available as a download)
+  5. Build the jar file and drop that into your Solr /lib directory
+  6. Modify your schema as follows:
  
+ == Schema Changes ==
+ {{{
+ <fieldtype name="text_kstem" class="solr.TextField">
+       <analyzer type="index">
+         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
+         <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt"/>
+         <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0"/>
+         <filter class="solr.LowerCaseFilterFactory"/>
+       <filter class="org.yourOrgHere.solr.analysis.KStemFilterFactory" 
cacheSize="20000"/>
+         <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
+       </analyzer>
+       <analyzer type="query">
+         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
+         <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" 
ignoreCase="true" expand="true"/>
+         <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt"/>
+         <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"/>
+         <filter class="solr.LowerCaseFilterFactory"/>
+         <filter class="org.yourOrgHere.solr.analysis.KStemFilterFactory" 
cacheSize="20000"/>
+         <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
+       </analyzer>
+ </fieldtype>
+ }}}
+ 
+ == KStemFilterFactory ==
  {{{#!java
  /*
  Copyright 2003,
@@ -43, +69 @@

  
  Modified for Solr use: H. Wagner, OCLC 2007-09-29
  */
- package your.package.name.here;
+ package package org.yourOrgHere.solr.analysis;;
  
  /**
   * <p>Title: </p>

Reply via email to