it has its limitations. that is why i am looking at what it would take to solve some of them. parsing documents to recognize sentences and storing sentence boundaries in the index would solve the ones that are most limiting. superposing interterm correlation on top of Lucene isn't very useful because then you have to build a new almost-duplicate index. since when people type in more than one word as a query, they almost invariably mean phrases, a search engine that doesn't take advantage of that isn't making use of fundamental linguistic knowledge. what has to be stored isn't much, but implementing it can be.
Herb.... -----Original Message----- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Friday, November 14, 2003 3:08 PM To: Lucene Users List Subject: Re: Vector Space Model in Lucene? I get the feeling you're looking for reasons that Lucene is inadequate. This may be the case for the uses you're speaking of, but there is quite a bit of flexibility with Lucene in terms of Analysis, scoring, and custom Query implementations that all relate to what you've been speaking of. And, of course, Lucene is a low-level component of which a higher level piece could be built around. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]