Hello everyone.
I've run into a weird problem with Solr's ranking engine. In a nutshell, the problem involves certain results getting EXTREMELY high rank scores. Here is an example: locRvwText:"Pizza Pizza"^10 OR locName:"Pizza Pizza"^30 The way I understand it is that the locName part of the query should be boosted 3x more then the locRvwText. However, when running this query the first result is: 10.8226 Johnnie's New York Pizzeria Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... − 10.8226 = (MATCH) product of: 21.6452 = (MATCH) sum of: 21.6452 = weight(locRvwText:"pizza pizza"^10.0 in 3792465), product of: 0.3354544 = queryWeight(locRvwText:"pizza pizza"^10.0), product of: 10.0 = boost 14.428232 = idf(locRvwText: pizza=8156 pizza=8156) 0.0023249863 = queryNorm 64.52502 = fieldWeight(locRvwText:"pizza pizza" in 3792465), product of: 4.472136 = tf(phraseFreq=20.0) 14.428232 = idf(locRvwText: pizza=8156 pizza=8156) 1.0 = fieldNorm(field=locRvwText, doc=3792465) 0.5 = coord(1/2) How come the phrase frequency for rvwText comes back as 20? The field rvwText is defined in the following way: And my text fields are defined in the following way: Forgive me if I am wrong, but shouldn't the RemoveDuplicatesTokenFilterFactory have the string "Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza... Pizza..." Count as simplu one Pizza? I'd appreciate any help I can get! Thanks! -- View this message in context: http://www.nabble.com/Question-regarding-Solr-ranking-tp15719752p15719752.html Sent from the Solr - User mailing list archive at Nabble.com.