Hi Jakub, I have ported the KStemmer for use in most recent Solr trunk version. My stemmer is located in the lib directory of Solr "solr/lib/KStemmer-2.00.jar" because it belongs to Solr.
Write it as FilterFactory and use it as Filter like: <filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory" protected="protwords.txt" /> This is how my fieldType looks like: <fieldType name="text_kstem" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.WhitespaceTokenizerFactory" /> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="false" /> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="1" /> <filter class="solr.LowerCaseFilterFactory" /> <filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory" protected="protwords.txt" /> <filter class="solr.RemoveDuplicatesTokenFilterFactory" /> </analyzer> <analyzer type="query"> <tokenizer class="solr.WhitespaceTokenizerFactory" /> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" /> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" splitOnCaseChange="1" /> <filter class="solr.LowerCaseFilterFactory" /> <filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory" protected="protwords.txt" /> <filter class="solr.RemoveDuplicatesTokenFilterFactory" /> </analyzer> </fieldType> Regards, Bernd Am 28.10.2010 14:56, schrieb Jakub Godawa: > Hi! > There is a polish stemmer http://www.getopt.org/stempel/ and I have > problems connecting it with solr 1.4.1 > Questions: > > 1. Where EXACTLY do I put "stemper-1.0.jar" file? > 2. How do I register the file, so I can build a fieldType like: > > <fieldType name="text_pl" class="solr.TextField"> > <analyzer class="org.geoopt.solr.analysis.StempelTokenFilterFactory"/> > </fieldType> > > 3. Is that the right approach to make it work? > > Thanks for verbose explanation, > Jakub.