Hi Jakub,

I have ported the KStemmer for use in most recent Solr trunk version.
My stemmer is located in the lib directory of Solr "solr/lib/KStemmer-2.00.jar"
because it belongs to Solr.

Write it as FilterFactory and use it as Filter like:
<filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory" 
protected="protwords.txt" />

This is how my fieldType looks like:

    <fieldType name="text_kstem" class="solr.TextField" 
positionIncrementGap="100">
      <analyzer type="index">
        <tokenizer class="solr.WhitespaceTokenizerFactory" />
        <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt" enablePositionIncrements="false" />
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="1" catenateNumbers="1"
catenateAll="0" splitOnCaseChange="1" />
        <filter class="solr.LowerCaseFilterFactory" />
        <filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory" 
protected="protwords.txt" />
        <filter class="solr.RemoveDuplicatesTokenFilterFactory" />
      </analyzer>
      <analyzer type="query">
        <tokenizer class="solr.WhitespaceTokenizerFactory" />
        <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt" />
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="0" catenateNumbers="0"
catenateAll="0" splitOnCaseChange="1" />
        <filter class="solr.LowerCaseFilterFactory" />
        <filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory" 
protected="protwords.txt" />
        <filter class="solr.RemoveDuplicatesTokenFilterFactory" />
      </analyzer>
    </fieldType>

Regards,
Bernd



Am 28.10.2010 14:56, schrieb Jakub Godawa:
> Hi!
> There is a polish stemmer http://www.getopt.org/stempel/ and I have
> problems connecting it with solr 1.4.1
> Questions:
> 
> 1. Where EXACTLY do I put "stemper-1.0.jar" file?
> 2. How do I register the file, so I can build a fieldType like:
> 
> <fieldType name="text_pl" class="solr.TextField">
>   <analyzer class="org.geoopt.solr.analysis.StempelTokenFilterFactory"/>
> </fieldType>
> 
> 3. Is that the right approach to make it work?
> 
> Thanks for verbose explanation,
> Jakub.

Reply via email to