Hi,
I had a look at the scoring equation and read the scoring online document:
http://lucene.apache.org/java/docs/scoring.html#Scoring
It is clear to me how the scoring equation would work for a query that
contains AND:
http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc//org/apache
On Mar 9, 2008, at 12:39 PM, Ghinwa Choueiter wrote:
but what exactly happens when there are OR's, for eg. (life OR
place OR time)
The scoring equation can get a score for life, place, time
separately, but what does it do with them then? Does it also add them.
The coord factor kicks in t
but shouldn't the coord factor kick in with AND instead of OR? I understand
why you would want to use coord in the case of AND, where you reward more
the documents that contain most of the terms in the query. However in the
case of OR, it should not matter if all the OR operands are in the
doc
I have been trying to understand all of this better myself, so while I
am no expert, here is my take:
Lucene is really a combined Vector Space / Boolean Model search engine.
At its core, Lucene is essentially a Vector Space Model search engine:
scoring is done by comparing a query term vector
With AND, _all_ clauses are required, not just most. With OR, the
idea is to reward documents that match more clauses.
Erik
On Mar 9, 2008, at 1:38 PM, Ghinwa Choueiter wrote:
but shouldn't the coord factor kick in with AND instead of OR? I
understand why you would want to use coo