Ulrich Mayring wrote:
does anyone know of good stopword lists for use with Lucene? I'm interested in English and German lists.

The Snowball project has good stop lists.


See:

  http://snowball.tartarus.org/
  http://snowball.tartarus.org/english/stop.txt
  http://snowball.tartarus.org/german/stop.txt

Snowball stemmers are pre-packaged for use with Lucene at:

http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/

This project should be updated to include the Snowball stop lists too. I have not had the time to do this. This would be a great contribution if someone who is qualified has the time.

Doug


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to