Michael Blakeley wrote:
I think the interesting point for your question is that scores are calculated based on inverse document frequency (IDF) as well as term frequency (TF). If that doesn't suit your application, you can choose an alternative scoring technique: try score-logtf, or score-simple, as options to cts:search() - http://developer.marklogic.com/pubs/3.1/apidocs/SearchBuiltins.html#search has more information.


Sorry for the lateness of the reply (stuff cropping up at home, then a bank holiday, etc etc etc). I have tried "score-logtfidf", "score-logtf", and even "score-simple". And although they change the scores they do not seem to change the ordering. The problem as I see it is that a weighting that applies to documents that matched on dc:title seems to be applied to documents that do not match the dc:title. Given,

cts:element-query(xs:QName("dc:title"),cts:word-query("bach",(),16)),
cts:element-query(xs:QName("opp:body"),cts:word-query("bach"))

in a cts:or-query should boost the score of documents that match "bach" in the dc:title element and not boost the score for documents that do not. However the examples show that the score of documents that do not have "bach" in the dc:title element are being boosted along with those that do. This is confusing and makes me feel like I have no idea as to what is going on. My understanding is that results 11 and 12 should not have been boosted, but they were. I need to know why if I am to make use of this facility.

--
Peter Hickman.

Semantico, Lees House, 21-23 Dyke Road, Brighton BN1 3FE
t: 01273 722222
f: 01273 723232
e: [EMAIL PROTECTED]
w: www.semantico.com

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to