Hi,

I might be the only person on the list who's having a hard time
following this discussion. Would one of you wise folks care to point me
to a good "dummies", also known as an executive summary, resource about
the theoretical background of all of this. I understand the basic
premise of collecting the "words" and having pointers to documents and
weights, but beyond that ...

TIA,

Dror

On Fri, Nov 14, 2003 at 12:52:15PM -0500, Chong, Herb wrote:
> i don't know of any open source search engine that incorporates interterm 
> correlation. i have been looking into how to do this in Lucene and so far, it's not 
> been promising. the indexing engine and file format needs to be changed. there are 
> very few search engines that incorporate interterm correlation in any mathematically 
> and linguistically rigorous manner. i designed a couple, but they were all research 
> experiments.
> 
> if you are familiar with the TREC automatic adhoc track? my experiments with the 
> TREC-5 to TREC-7 questions produced about 0.05 to 0.10 improvement in average 
> precision by proper use of interterm correlation. my project at the time was 
> cancelled after TREC-7 and so there haven't been any new developments.
> 
> Herb....
> 
> -----Original Message-----
> From: Andrzej Bialecki [mailto:[EMAIL PROTECTED]
> Sent: Friday, November 14, 2003 12:39 PM
> To: Lucene Users List
> Subject: Re: Vector Space Model in Lucene?
> 
> Herb....
> 
> Hmm... Are you perhaps familiar with some open system which doesn't? I'm 
> curious because one of my projects (already using Lucene) could benefit 
> from such feature. Right now I'm using a bastardized version of Markov 
> chains, but it's more of a hack...
> 
> -- 
> Best regards,
> Andrzej Bialecki
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 

-- 
Dror Matalon
Zapatec Inc 
1700 MLK Way
Berkeley, CA 94709
http://www.fastbuzz.com
http://www.zapatec.com

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to