Can anyone tell me why this option is not provided for Stemming. I am not sure about it but the original token can be preserved by using <filter class="solr.KeywordRepeatFilterFactory"/> too. To avoid any duplicate token in the document <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> can be used at the end of analysis chain.
Hope this helps. Regards, Modassar On Tue, Aug 25, 2015 at 2:12 PM, Vishnu Mishra <vdil...@gmail.com> wrote: > Hi, > > I was working with Lucene 5.2 and trying to index some document. I am using > EnglishMinimalStemFilterFactory and I found that there is no option for > keeping the original text as wel as analyzed term into lucene index. > WordDelimiterFilterFactory provides preserveOriginal option to do this. > Can > anyone tell me why this option is not provided for Stemming. For e.g. if I > want to store both *Methods* and *Method* in my index then I think there is > no option is available in Lucene to do this. I also noticed that if we > place EnglishMinimalStemFilterFactory after WordDelimiterFilterFactory with > option preserveOriginal ="1" then it store both *Methods* and *Method*. > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Preserve-Original-Option-In-Stemming-EnglishMinimalStemFilterFactory-tp4225116.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >