On Jun 3, 2004, at 4:09 PM, Musku, Anil (LA) wrote:
Can anyone provide some help on writing a stemmer for non-english languages?

Have a look at the snowball project in the Lucene sandbox. If its non-European-based languages, I suspect it's quite complex. It's highly language dependent.


How proficient must I be in a language for which I wish to write the stemmer?

I would venture to say you would need to be an expert in a language to write a decent stemmer. The SnowballAnalyzer is quite hairy underneath, that's for sure.


        Erik


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to