Re: Collecting all stemming token

2017-02-03 Thread Xiaolong Zheng
Oh sorry, please ignore my previous code snippet, my intent was: By checking the position increment? import org.apache.lucene.analysis.TokenFilter; import org.apache.lucene.analysis.TokenStream; import org.apache.lucene.analysis.tokenattributes.*; import java.io.IOException; import java.util.Lin

Collecting all stemming token

2017-02-03 Thread Xiaolong Zheng
Hello, I am trying collect stemming changes in my search index during the indexing time. So I could collect a list of stemmed word -> [variety original word] (e.g: plot -> [plots, plotting, plotted]) for a later use. I am using k-stem filter + KeywordRepeatFilter + RemoveDuplicatesTokenFilter to