Dear Users, I need to develop my language specific analyzer that: 1) does not remove punctuations 2) lowercases and stems each term in the text.
I have tried some of the pre-implemented language analyzer (e.g. German and Italian analyzers), but they remove punctuation. I/m not sure, but probably what I need is the whitespace analyzer instead of the standard analyzer. Is there a way to force each language specific analyzer to use the whitespace analyzer or in general not to remove punctuations? Thanks a lot! Marco