Hi guys,
I am a Lucene.Net user but I got no replies from there so I decided to try 
here, hoping that someone here encountered the same problem.

I got a problem with RussianStemmer. We try to use it with Snowball analyzer 
and it just won't work as expected. It seems that it just don't do anything , 
like transfer "dogs" to "dog", etc. 

Perhaps I have some problem with the encoding?
I looked at the source code of RussianStemmer and I see 

a_0 = new Among[]{new Among("\u00D7\u00DB\u00C9",
kind of code. It looks like Unicode, which probably what Russian is represented 
like so I tried some games with my Russian text before sending it to the 
indexing (UTF8ToUnicode, etc..) but it didn't do any good.  

Anybody could help me with that?

 
Maxim

Reply via email to