This looks good.  It is low-risk, since it changes no existing classes,
adding new classes in a new package.  It looks as though you've implemented
the right things (a stemming filter, and an analyzer that plugs this
together with a stop list and tokenizer).  I don't speak Dutch, so I cannot
assess the quality of these.

My concerns before this is checked in are that: 
  1. It compiles.  I have not tested this yet, and we don't want to break
the build.
  2. More Javadoc is added.  My rule of thumb is that there should be no
empty boxes in the generated javadoc, i.e., every publicly visible class,
method and field must be documented.  Also, every package should have a
package.html with at least a one-line description of what is in the package.
  3. Fewer classes and methods are public.  Do the Rule, RuleVector and
DutchStemmer classes need to be public?  Probably little more than the core
classes (DutchStemFilter DutchAnalyzer) need to be public.

Doug

> -----Original Message-----
> From: Maurits van Wijland [mailto:[EMAIL PROTECTED]]
> Sent: Wednesday, October 10, 2001 12:48 AM
> To: [EMAIL PROTECTED]
> Cc: [EMAIL PROTECTED]; Doug Cutting
> Subject: Dutch Stemmer
> 
> 
> Doug, 
> 
> Here is my first version of the dutch stemmer. 
> Doug could you have a look at this?
> 
> kind regards,
> 
> Maurits
> 
> 
> 

Reply via email to