Re: Same score for different length matches
In addition to what Chris has correctly suggested, I would like to focus on this sentence : " I am decently certain that at one point in time it worked in a way that a higher match length would rank higher" You mean a match in a longer field would rank higher than a match in a shorter field ? is that what you want ( because it is counter intuitive) ? Furthermore I see that some stemming is applied at query time , is that what you want ? - --- Alessandro Benedetti Search Consultant, R&D Software Engineer, Director Sease Ltd. - www.sease.io -- View this message in context: http://lucene.472066.n3.nabble.com/Same-score-for-different-length-matches-tp4343660p4343917.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Same score for different length matches
: we have multiple documents that are matches for the query in question : ("name:hubwagen"). Thing is, some of the documents only contain the : query, while others match 100% in the "name" field: ... : 5.9861565 = (MATCH) weight(name:hubwag in 8093) [DefaultSimilarity], : result of: : 5.9861565 = fieldWeight in 8093, product of: : 1.0 = tf(freq=1.0), with freq of: : 1.0 = termFreq=1.0 : 5.9861565 = idf(docFreq=109, maxDocs=16101) : 1.0 = fieldNorm(doc=8093) ... : 5.9861565 = (MATCH) weight(name:hubwag in 9537) [DefaultSimilarity], : result of: : 5.9861565 = fieldWeight in 9537, product of: : 1.0 = tf(freq=1.0), with freq of: : 1.0 = termFreq=1.0 : 5.9861565 = idf(docFreq=109, maxDocs=16101) : 1.0 = fieldNorm(doc=9537) ... : that a higher match length would rank higher. As far as I can read in : the SolrRelevancyFAQ, the correct term is "lengthNorm". However, I a : missing a preference for the full match. lengthNorm is a Similarity concept that rolls into the "fieldNorm" at index time. According to your score explanations, the fieldNorm is 1.0 for both docs, suggestion that you have norms disabled -- see the omitNorms option on the fieldType for your "name" field. -Hoss http://www.lucidworks.com/
Same score for different length matches
Hey, we have multiple documents that are matches for the query in question ("name:hubwagen"). Thing is, some of the documents only contain the query, while others match 100% in the "name" field: Hochhubwagen 5.9861565 Hubwagen 5.9861565 The debug looks like this (for the first and 5th match): namhubwagnamehubwag name:Hubwagen name:Hubwagen name:hubwag name:hubwag 5.9861565 = (MATCH) weight(name:hubwag in 8093) [DefaultSimilarity], result of: 5.9861565 = fieldWeight in 8093, product of: 1.0 = tf(freq=1.0), with freq of: 1.0 = termFreq=1.0 5.9861565 = idf(docFreq=109, maxDocs=16101) 1.0 = fieldNorm(doc=8093) 5.9861565 = (MATCH) weight(name:hubwag in 9537) [DefaultSimilarity], result of: 5.9861565 = fieldWeight in 9537, product of: 1.0 = tf(freq=1.0), with freq of: 1.0 = termFreq=1.0 5.9861565 = idf(docFreq=109, maxDocs=16101) 1.0 = fieldNorm(doc=9537) Now, I am decently certain that at one point in time it worked in a way that a higher match length would rank higher. As far as I can read in the SolrRelevancyFAQ, the correct term is "lengthNorm". However, I a missing a preference for the full match. Usually, the debug helps me identify mistakes, but in this case, the debug only tells me that the scores are perfectly equal, down to the lowest level.