Scoring a query with OR's

2008-03-09 Thread Ghinwa Choueiter
Hi, I had a look at the scoring equation and read the scoring online document: http://lucene.apache.org/java/docs/scoring.html#Scoring It is clear to me how the scoring equation would work for a query that contains AND: http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc//org/apache

Re: Scoring a query with OR's

2008-03-09 Thread Erik Hatcher
On Mar 9, 2008, at 12:39 PM, Ghinwa Choueiter wrote: but what exactly happens when there are OR's, for eg. (life OR place OR time) The scoring equation can get a score for life, place, time separately, but what does it do with them then? Does it also add them. The coord factor kicks in t

Re: Scoring a query with OR's

2008-03-09 Thread Ghinwa Choueiter
but shouldn't the coord factor kick in with AND instead of OR? I understand why you would want to use coord in the case of AND, where you reward more the documents that contain most of the terms in the query. However in the case of OR, it should not matter if all the OR operands are in the doc

Re: Scoring a query with OR's

2008-03-09 Thread Mark Miller
I have been trying to understand all of this better myself, so while I am no expert, here is my take: Lucene is really a combined Vector Space / Boolean Model search engine. At its core, Lucene is essentially a Vector Space Model search engine: scoring is done by comparing a query term vector

Re: Scoring a query with OR's

2008-03-09 Thread Erik Hatcher
With AND, _all_ clauses are required, not just most. With OR, the idea is to reward documents that match more clauses. Erik On Mar 9, 2008, at 1:38 PM, Ghinwa Choueiter wrote: but shouldn't the coord factor kick in with AND instead of OR? I understand why you would want to use coo