The PorterStemmer class derives from Martin Porter's original
implementation, with some bugs fixed (by me.) I deliberately made the
minimal modifications to simplify merging with future versions of the
Porter stemmer (of which there were none.)
I just this week joined the mailing list, and on this topic thought
I'd mention that I've rewritten the PorterStemmer Java class, cleaning
up whitespace and predeclaring all the Strings for better performance.
It passes the file-in file-out test provided by Martin Porter (iow,
no change from the existing algorithm). The source for mine was taken
from his site -- I'm not sure of the origin of the one in Lucene. I
could also add an Apache license to the top.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]