Re: a proof that every word is indexing properly

2010-12-02 Thread Erick Erickson
I'm really curious how you expert knows that the present system "indexes every word properly". You can certainly test any scenario that can be defined precisely via unit tests as Lance suggests. Ask for *concrete* examples he's concerned with. Write tests to show that each example works. Ask for m

Re: a proof that every word is indexing properly

2010-12-02 Thread Toke Eskildsen
On Thu, 2010-12-02 at 03:54 +0100, David Linde wrote: > Has anyone figured out a way to logically prove that lucene indexes ever > word properly? The "Precision and recall in lucene"-thread seems relevant here. > Our company has done alot of research into lucene, all of our IT department > is rea

Re: a proof that every word is indexing properly

2010-12-01 Thread Lance Norskog
This is what unit tests are for. On Wed, Dec 1, 2010 at 6:57 PM, David Fertig wrote: > Stop words are never indexed; you may need to empty your stop list. > > Luke (open-source w/code available) can browse and re-create documents > in indexes using their terms already.  Compare that to the origin

RE: a proof that every word is indexing properly

2010-12-01 Thread David Fertig
Stop words are never indexed; you may need to empty your stop list. Luke (open-source w/code available) can browse and re-create documents in indexes using their terms already. Compare that to the original to see if you are satisfied. -Original Message- From: David Linde [mailto:davidli