Erik Hatcher wrote:
On Oct 29, 2004, at 10:39 AM, Murray Altheim wrote:

In short, I *think* I'm using a newer version of the code than the
one in the repository, plus I've cleaned it up.


Can you provide some tests that show differences in how it stems between yours and the built-in one?

Certainly, but the test I used is identical to the one that Martin Porter provides: an input file and an output file. In running my modified version against the provided output file (which is just a long list of words), the output is identical. I made no algorithmic changes to the code, only formatting and syntax-choice changes to better conform to Java coding guidelines and the aforementioned predeclaration of final Strings, which has no effect except for performance.

The test files are identical to the ones on Martin Porter's web
page:

   http://www.tartarus.org/~martin/PorterStemmer/index.html

The PorterStemFilter is not used by any built-in Analyzers, so I actually think we should move it out to the Analyzers Sandbox area or deprecate it in favor of the Snowball stemmer. Thoughts?

None. As I mentioned, I'm new to this project and am not familiar with the advantages of the Snowball stemmer. In reading through the pages on SourceForge, e.g.,

   http://snowball.tartarus.org/texts/introduction.html

there are apparently pros and cons. But for myself, I'd leave it up
to those with more history in this project to make these kinds of
decisions.

Murray

......................................................................
Murray Altheim                    http://kmi.open.ac.uk/people/murray/
Knowledge Media Institute
The Open University, Milton Keynes, Bucks, MK7 6AA, UK               .

   [International terrorism] is a fantasy that has been exaggerated
   and distorted by politicians. It is a dark illusion that has
   spread unquestioned through governments around the world, the
   security services, and the international media. In an age when
   all the grand ideas have lost credibility, fear of a phantom
   enemy is all the politicians have left to maintain their power."

   The making of the terror myth, The Guardian
   http://www.guardian.co.uk/terrorism/story/0,12780,1327904,00.html

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to