Re: [MarkLogic Dev General] Query weights

Peter Hickman Tue, 08 May 2007 02:32:17 -0700

Michael Blakeley wrote:

I think the interesting point for your question is that scores arecalculated based on inverse document frequency (IDF) as well as termfrequency (TF). If that doesn't suit your application, you can choosean alternative scoring technique: try score-logtf, or score-simple, asoptions to cts:search() -http://developer.marklogic.com/pubs/3.1/apidocs/SearchBuiltins.html#searchhas more information.

Sorry for the lateness of the reply (stuff cropping up at home, then abank holiday, etc etc etc). I have tried "score-logtfidf","score-logtf", and even "score-simple". And although they change thescores they do not seem to change the ordering. The problem as I see itis that a weighting that applies to documents that matched on dc:titleseems to be applied to documents that do not match the dc:title. Given,


cts:element-query(xs:QName("dc:title"),cts:word-query("bach",(),16)),
cts:element-query(xs:QName("opp:body"),cts:word-query("bach"))

in a cts:or-query should boost the score of documents that match "bach"in the dc:title element and not boost the score for documents that donot. However the examples show that the score of documents that do nothave "bach" in the dc:title element are being boosted along with thosethat do. This is confusing and makes me feel like I have no idea as towhat is going on. My understanding is that results 11 and 12 should nothave been boosted, but they were. I need to know why if I am to make useof this facility.


--
Peter Hickman.

Semantico, Lees House, 21-23 Dyke Road, Brighton BN1 3FE
t: 01273 722222
f: 01273 723232
e: [EMAIL PROTECTED]
w: www.semantico.com

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Re: [MarkLogic Dev General] Query weights

Reply via email to