[
https://issues.apache.org/jira/browse/SOLR-379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan McKinley resolved SOLR-379.
--------------------------------
Resolution: Duplicate
> KStem Token Filter
> ------------------
>
> Key: SOLR-379
> URL: https://issues.apache.org/jira/browse/SOLR-379
> Project: Solr
> Issue Type: New Feature
> Components: search
> Reporter: Pieter Berkel
> Priority: Minor
> Attachments: KStemSolr.zip
>
>
> A Lucene / Solr implementation of the KStem stemmer. Full credit goes to
> Harry Wagner for adapting the Lucene version found here:
> http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi
> Background discussion to this stemmer (including licensing issues) can be
> found in this thread:
> http://www.nabble.com/Embedded-about-50--faster-for-indexing-tf4325720.html#a12376295
> I've made some minor changes to KStemFilterFactory so that it compiles
> cleanly against trunk:
> 1) removed some unnecessary imports
> 2) changed the init() method parameters introduced by SOLR-215
> 3) moved KStemFilterFactory into package org.apache.solr.analysis
> Once compiled and included in your Solr war (or as a jar in your lib
> directory, the KStem filter can be used in your schema very easily:
> <analyzer type="index">
> <tokenizer class="solr.StandardTokenizerFactory"/>
> <filter class="solr.StopFilterFactory" ignoreCase="true"
> words="stopwords.txt"/>
> <filter class="solr.StandardFilterFactory"/>
> <filter class="solr.LowerCaseFilterFactory"/>
> <filter class="solr.KStemFilterFactory" cacheSize="20000"/>
> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
> </analyzer>
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]