Hi guys, I am a Lucene.Net user but I got no replies from there so I decided to try here, hoping that someone here encountered the same problem.
I got a problem with RussianStemmer. We try to use it with Snowball analyzer and it just won't work as expected. It seems that it just don't do anything , like transfer "dogs" to "dog", etc. Perhaps I have some problem with the encoding? I looked at the source code of RussianStemmer and I see a_0 = new Among[]{new Among("\u00D7\u00DB\u00C9", kind of code. It looks like Unicode, which probably what Russian is represented like so I tried some games with my Russian text before sending it to the indexing (UTF8ToUnicode, etc..) but it didn't do any good. Anybody could help me with that? Maxim