Oh sorry, please ignore my previous code snippet, my intent was:
By checking the position increment?
import org.apache.lucene.analysis.TokenFilter;
import org.apache.lucene.analysis.TokenStream;
import org.apache.lucene.analysis.tokenattributes.*;
import java.io.IOException;
import java.util.Lin
Hello,
I am trying collect stemming changes in my search index during the indexing
time. So I could collect a list of stemmed word -> [variety original word]
(e.g: plot -> [plots, plotting, plotted]) for a later use.
I am using k-stem filter + KeywordRepeatFilter
+ RemoveDuplicatesTokenFilter to