Hello,

> >   - Added a few more words to the stop word list (MS' 
> > contribution via Alan).
> 
> I don't think we should do that here.  This could break any
> application
> which is already using this stop list when it upgrades Lucene, since
> it will
> no longer be possible to search for these words.

Ah, didn't think of that.  I'll revert to the previous revision.

> What we need is a facility
> to load stop lists from file-based resources, and to include a new
> such
> resource that contains this MS stop list.  But I don't think we
> should
> change the default stop lists.  What do others think?  Is that too
> conservative?

Couldn't applications already use this:

  /** Builds an analyzer with the given stop words. */
  public StandardAnalyzer(String[] stopWords) {
    stopTable = StopFilter.makeStopTable(stopWords);
  }

And we leave it up to them to figure out how they get the list of stop
words to this constructor.

> >   - Re-indented the whole class.
> 
> I don't think we should re-indent whole files.  It makes it hard to
> figure
> out what's changed over time.  We should try to use a similar
> indenting
> style, but if someone has written the code, they have the right to
> indent
> it.  I generally only re-indent code if I'm committing it for the
> first
> time, or for those parts of the code that I change.

I knew I was walking on thin ice when I did this. :)
I was thinking more along the lines of making all code in the project
uniform, which may be different from the original.
I prefer the former, but either one is fine with me, as long as it's
clear what we're sticking to.  I think the former would not be hard to
achieve since we have a fairly small number of developers with commit
priviledges.

Otis


__________________________________________________
Do You Yahoo!?
Yahoo! Sports - Coverage of the 2002 Olympic Games
http://sports.yahoo.com

--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to