Can anyone provide some help on writing a stemmer for non-english languages?
Have a look at the snowball project in the Lucene sandbox. If its non-European-based languages, I suspect it's quite complex. It's highly language dependent.
How proficient must I be in a language for which I wish to write the stemmer?
I would venture to say you would need to be an expert in a language to write a decent stemmer. The SnowballAnalyzer is quite hairy underneath, that's for sure.
Erik
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
