Re: stemming in Lucene

2008-04-15 Thread Hannu Väisänen
Wojtek H wrote: >Snowball stemmers are part of Lucene, but for few languages only >But maybe there is a better way or there are people working on >something like that? I use Malaga (http://home.arcor.de/bjoern-beutel/malaga/) for lemmatization and index the result. http://joyds1.joensuu.fi/progra

Re: stemming in Lucene

2008-04-02 Thread Mathieu Lecarme
Wojtek H a écrit : Hi all, Snowball stemmers are part of Lucene, but for few languages only. We have documents in various languages and so need stemmers for many languages (in particular polish). One of the ideas is to use ispell dictionaries. There are ispell dicts for many languages and so thi

Re: stemming in Lucene

2008-04-01 Thread Karl Wettin
Wojtek H skrev: Snowball stemmers are part of Lucene, but for few languages only. We org.apache.lucene.analysis contains a few more stemmers. have documents in various languages and so need stemmers for many languages (in particular polish). Have you seen Stempel? http://www.getopt.org/ste

stemming in Lucene

2008-04-01 Thread Wojtek H
Hi all, Snowball stemmers are part of Lucene, but for few languages only. We have documents in various languages and so need stemmers for many languages (in particular polish). One of the ideas is to use ispell dictionaries. There are ispell dicts for many languages and so this solution is good fo