Jen, I looked up a few places for you but didn't get lucky. In the meantime, I found this document that you might find resourceful. http://www.edbt2006.de/edbt-share/IntroductionToTextMining.pdf
Yagmur Coker On Nov 29, 2007 12:27 PM, Jan Philipp Seng <[EMAIL PROTECTED]> wrote: > Hello Everyone, > > > > stemming german words is not very accurate with the stemmers in Lucene. > This > is because the german language is complicated and so many special cases > cant > be put into a algorithm. So I have decided to add a lookup map to a > stemmer, > that contains stems for the most frequent german words, where the stemmer > does not work correct. I have searched the web and could not find a word > list for german stems (or you have to pay for it EUR 10.000 or more). I > have > found a nice list with english stems > ( > http://www.comp.lancs.ac.uk/computing/research/stemming/Links/resources.htm > ), but not for german. Does anybody know where I can find a list like this > or do I have to do all the work on my own??? > > > > Greetings, > > Jan Philipp Seng > >
