Hi all,

I have just finished the implementation of Russian stemming algorithm (described at 
http://snowball.sourceforge.net/russian/stemmer.html). Today it has passed all  tests 
on a sample Russian vocabulary of almost 50,000 words 
(http://snowball.sourceforge.net/russian/voc.txt), i.e. all stems generated for this 
vocabulary, are matched to Snowball's stems 
(http://snowball.sourceforge.net/russian/output.txt). The stemmer supports Russian 
Unicode, KOI8 and Win1251 charsets. I'm planning to finish full-featured Russian 
Analyser for Lucene by the end of next week. Could you please tell me how do I 
contribute my source code to Lucene?

Thanks,
Boris Okner

Reply via email to