Michael Blakeley wrote:
I think the interesting point for your question is that scores are
calculated based on inverse document frequency (IDF) as well as term
frequency (TF). If that doesn't suit your application, you can choose
an alternative scoring technique: try score-logtf, or score-simple, as
options to cts:search() -
http://developer.marklogic.com/pubs/3.1/apidocs/SearchBuiltins.html#search
has more information.
Sorry for the lateness of the reply (stuff cropping up at home, then a
bank holiday, etc etc etc). I have tried "score-logtfidf",
"score-logtf", and even "score-simple". And although they change the
scores they do not seem to change the ordering. The problem as I see it
is that a weighting that applies to documents that matched on dc:title
seems to be applied to documents that do not match the dc:title. Given,
cts:element-query(xs:QName("dc:title"),cts:word-query("bach",(),16)),
cts:element-query(xs:QName("opp:body"),cts:word-query("bach"))
in a cts:or-query should boost the score of documents that match "bach"
in the dc:title element and not boost the score for documents that do
not. However the examples show that the score of documents that do not
have "bach" in the dc:title element are being boosted along with those
that do. This is confusing and makes me feel like I have no idea as to
what is going on. My understanding is that results 11 and 12 should not
have been boosted, but they were. I need to know why if I am to make use
of this facility.
--
Peter Hickman.
Semantico, Lees House, 21-23 Dyke Road, Brighton BN1 3FE
t: 01273 722222
f: 01273 723232
e: [EMAIL PROTECTED]
w: www.semantico.com
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general