Re: Lucene scoring: coord_q_d factor

2006-12-19 Thread Doug Cutting
Karl Koch wrote: Are there any other papers that regard the combination of coordination level matching and TFxIDF as advantageous? We independently developed coordination-level matching combined with TFxIDF when I worked at Apple. This is documented in: http://www.informatik.uni-trier.de/~

Re: Lucene scoring: coord_q_d factor

2006-12-14 Thread Grant Ingersoll
FYI: The Wiki has a fair number of resources on IR: http:// wiki.apache.org/jakarta-lucene/InformationRetrieval (I have added a link to this conversation, which contains a lot of useful information) Karl, if you are so inclined, please feel free to add any of the references you have found t

Re: Lucene scoring: coord_q_d factor

2006-12-14 Thread Soeren Pekrul
Soeren Pekrul wrote: The score for a document is the sum of the term weights w(tf, idf) for each containing term. So you have already the combination of coordination level matching with IDF. Now it is possible that your query requests three terms A, B and C. Two of them (A and B) are quite ofte

Re: Lucene scoring: coord_q_d factor

2006-12-14 Thread Karl Koch
: java-user@lucene.apache.org Betreff: Re: Lucene scoring: coord_q_d factor > Karl Koch wrote: > > If I do not misunderstand that extract, I would say it suggests the > combination of coordination level matching with IDF. I am interested in your > view and those who read this? &

Re: Lucene scoring: coord_q_d factor

2006-12-14 Thread Soeren Pekrul
Karl Koch wrote: If I do not misunderstand that extract, I would say it suggests the combination of coordination level matching with IDF. I am interested in your view and those who read this? I understand that sentence: "The natural solution is to correlate a term's matching value with its co

Re: Lucene scoring: coord_q_d factor

2006-12-13 Thread Otis Gospodnetic
:35:55 PM Subject: Re: Lucene scoring: coord_q_d factor Hello Paul, thank you for providing the link to that paper. I read it again, and you are right. I discovered the following text part: "In normal term co-ordination matches, if a request and document have a frequent term in common, th

Re: Lucene scoring: coord_q_d factor

2006-12-13 Thread Karl Koch
her papers that regard the combination of coordination level matching and TFxIDF as advantageous? Cheers, Karl Original-Nachricht Datum: Wed, 13 Dec 2006 21:00:45 +0100 Von: Paul Elschot <[EMAIL PROTECTED]> An: java-user@lucene.apache.org Betreff: Re: Lucene scoring: c

Re: Lucene scoring: coord_q_d factor

2006-12-13 Thread Paul Elschot
On Wednesday 13 December 2006 16:42, Karl Koch wrote: > Do you know about any papers that discuss this? Coordination is called co-ordination In the original idf paper by K. Spärck Jones, A statistical interpretation of term specificity and its application in retrieval., Journal of Documentation 2

Re: Lucene scoring: coord_q_d factor

2006-12-13 Thread Karl Koch
Do you know about any papers that discuss this? Karl Original-Nachricht Datum: Wed, 13 Dec 2006 10:31:41 -0500 Von: "Yonik Seeley" <[EMAIL PROTECTED]> An: java-user@lucene.apache.org Betreff: Re: Lucene scoring: coord_q_d factor > On 12/13/06, Karl Koc

Re: Lucene scoring: coord_q_d factor

2006-12-13 Thread Yonik Seeley
On 12/13/06, Karl Koch <[EMAIL PROTECTED]> wrote: To me, it seems that coordination level matching could be used if I don't want to use TFxIDF but not together with it. In this context, I wonder what benefit the "coordination level matching" has in combination with TFxIDF? Well, if I search f

Re: Lucene scoring: coord_q_d factor

2006-12-13 Thread Karl Koch
ble explaination (perhaps augmented with some evidence in literature) that makes it clear why it is used together with TFxIDF. Thank you, Karl Original-Nachricht Datum: Tue, 12 Dec 2006 17:15:48 -0500 Von: Steven Rowe <[EMAIL PROTECTED]> An: java-user@lucene.apache.org

Re: Lucene scoring: coord_q_d factor

2006-12-12 Thread Steven Rowe
Karl Koch wrote: > Is there any other paper that actually shows the benefit of doing > this particular normalisation with coord_q_d? I am not suggesting > here that it is not useful, I am just looking for evidence how the > idea developed. I think it's a mischaracterization to call coordination a

Re: Lucene scoring: coord_q_d factor

2006-12-12 Thread Karl Koch
: java-user@lucene.apache.org Betreff: Re: Lucene scoring: coord_q_d factor > Karl Koch wrote: > > The coord(q,d) normalisation is "a score factor based on how many of > > the query terms are found in the specified document." and described > > here: > > >

Re: Lucene scoring: coord_q_d factor

2006-12-12 Thread Steven Rowe
Karl Koch wrote: > The coord(q,d) normalisation is "a score factor based on how many of > the query terms are found in the specified document." and described > here: > > http://lucene.apache.org/java/docs/api/org/apache/lucene/search/Similarity.html#formula_coord > > Does this have a theoretical