Jen,

I looked up a few places for you but didn't get lucky. In the meantime, I
found this document that you might find resourceful.
http://www.edbt2006.de/edbt-share/IntroductionToTextMining.pdf

Yagmur Coker

On Nov 29, 2007 12:27 PM, Jan Philipp Seng <[EMAIL PROTECTED]> wrote:

> Hello Everyone,
>
>
>
> stemming german words is not very accurate with the stemmers in Lucene.
> This
> is because the german language is complicated and so many special cases
> cant
> be put into a algorithm. So I have decided to add a lookup map to a
> stemmer,
> that contains stems for the most frequent german words, where the stemmer
> does not work correct. I have searched the web and could not find a word
> list for german stems (or you have to pay for it EUR 10.000 or more). I
> have
> found a nice list with english stems
> (
> http://www.comp.lancs.ac.uk/computing/research/stemming/Links/resources.htm
> ), but not for german. Does anybody know where I can find a list like this
> or do I have to do all the work on my own???
>
>
>
> Greetings,
>
> Jan Philipp Seng
>
>

Reply via email to