I'm still on solr 1.4.0 and the analysis page looks like they should match, and other products with the same content do in fact match. I'm reindexing the non-matching ones to rule that out.
-----Original Message----- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Monday, July 25, 2011 1:58 PM To: solr-user@lucene.apache.org Subject: Re: please help explaining debug output Hmmm, I'm assuming that moreWords is your default text field, yes? But it works for me (tm), using 1.4.1. What version of Solr are you on? Also, take a glance at the admin/analysis page, that might help... Gotta run Erick On Mon, Jul 25, 2011 at 4:52 PM, Robert Petersen <rober...@buy.com> wrote: > Sorry, to clarify a search for P1102W matches all three docs but a > search for p1102w LaserJet only matches the second two. Someone asked > me a question while I was typing and I got distracted, apologies for any > confusion. > > -----Original Message----- > From: Robert Petersen [mailto:rober...@buy.com] > Sent: Monday, July 25, 2011 1:42 PM > To: solr-user@lucene.apache.org > Subject: please help explaining debug output > > I have three documents with the following product titles in a text field > called moreWords with analysis stack matching the solr example text > field definition. > > > > 1. HP LaserJet P1102W Monochrome Laser Printer > <http://www.buy.com/prod/hp-laserjet-p1102w-monochrome-laser-printer/q/l > oc/101/213824965.html> > > 2. HP CE285A (85A) Remanufactured Black Toner Cartridge for > LaserJet M1212nf, P1102, P1102W Series > <http://www.buy.com/prod/hp-ce285a-85a-remanufactured-black-toner-cartri > dge-for-laserjet/q/loc/101/217145536.html> > > 3. Black HP CE285A Toner Cartridge For LaserJet P1102W, LaserJet > M1130, LaserJet M1132, LaserJet M1210 > <http://www.buy.com/prod/black-hp-ce285a-toner-cartridge-for-laserjet-p1 > 102w-laserjet-m1130/q/loc/101/222045267.html> > > > > A search for P1102W matches (2) and (3), but not (1) above. Can someone > explain the debug output? It looks like I am getting a non-match on (1) > because term frequency is zero? Am I reading that right? If so, how > could that be? the searched terms are equivalently in all three docs. I > don't get it. > > > > > > <lst name="debug"> > > <str name="rawquerystring">p1102w LaserJet </str> > > <str name="querystring">p1102w LaserJet </str> > > <str name="parsedquery">+PhraseQuery(moreWords:"p 1102 w") > +PhraseQuery(moreWords:"laser jet")</str> > > <str name="parsedquery_toString">+moreWords:"p 1102 w" +moreWords:"laser > jet"</str> > > <lst name="explain"> > > <str name="222045267"> > > 3.64852 = (MATCH) sum of: > > 2.4758534 = weight(moreWords:"p 1102 w" in 6667236), product of: > > 0.7955347 = queryWeight(moreWords:"p 1102 w"), product of: > > 19.166107 = idf(moreWords: p=189166 1102=1135 w=445720) > > 0.041507367 = queryNorm > > 3.1121879 = fieldWeight(moreWords:"p 1102 w" in 6667236), product > of: > > 1.7320508 = tf(phraseFreq=3.0) > > 19.166107 = idf(moreWords: p=189166 1102=1135 w=445720) > > 0.09375 = fieldNorm(field=moreWords, doc=6667236) > > 1.1726664 = weight(moreWords:"laser jet" in 6667236), product of: > > 0.60590804 = queryWeight(moreWords:"laser jet"), product of: > > 14.597603 = idf(moreWords: laser=26731 jet=12685) > > 0.041507367 = queryNorm > > 1.9353869 = fieldWeight(moreWords:"laser jet" in 6667236), product > of: > > 1.4142135 = tf(phraseFreq=2.0) > > 14.597603 = idf(moreWords: laser=26731 jet=12685) > > 0.09375 = fieldNorm(field=moreWords, doc=6667236) > > > > </str> > > <str name="222045265"> > > 2.8656518 = (MATCH) sum of: > > 1.4294347 = weight(moreWords:"p 1102 w" in 6684158), product of: > > 0.7955347 = queryWeight(moreWords:"p 1102 w"), product of: > > 19.166107 = idf(moreWords: p=189166 1102=1135 w=445720) > > 0.041507367 = queryNorm > > 1.7968225 = fieldWeight(moreWords:"p 1102 w" in 6684158), product > of: > > 1.0 = tf(phraseFreq=1.0) > > 19.166107 = idf(moreWords: p=189166 1102=1135 w=445720) > > 0.09375 = fieldNorm(field=moreWords, doc=6684158) > > 1.4362172 = weight(moreWords:"laser jet" in 6684158), product of: > > 0.60590804 = queryWeight(moreWords:"laser jet"), product of: > > 14.597603 = idf(moreWords: laser=26731 jet=12685) > > 0.041507367 = queryNorm > > 2.3703551 = fieldWeight(moreWords:"laser jet" in 6684158), product > of: > > 1.7320508 = tf(phraseFreq=3.0) > > 14.597603 = idf(moreWords: laser=26731 jet=12685) > > 0.09375 = fieldNorm(field=moreWords, doc=6684158) > > > > </str> > > </lst> > > <str name="otherQuery">sku:213824965 > > </str> > > <lst name="explainOther"> > > <str name="213824965"> > > 0.0 = (NON-MATCH) Failure to meet condition(s) of required/prohibited > clause(s) > > 1.1911955 = weight(moreWords:"p 1102 w" in 32497), product of: > > 0.7955347 = queryWeight(moreWords:"p 1102 w"), product of: > > 19.166107 = idf(moreWords: p=189166 1102=1135 w=445720) > > 0.041507367 = queryNorm > > 1.4973521 = fieldWeight(moreWords:"p 1102 w" in 32497), product of: > > 1.0 = tf(phraseFreq=1.0) > > 19.166107 = idf(moreWords: p=189166 1102=1135 w=445720) > > 0.078125 = fieldNorm(field=moreWords, doc=32497) > > 0.0 = no match on required clause (moreWords:"laser jet") > > 0.0 = weight(moreWords:"laser jet" in 32497), product of: > > 0.60590804 = queryWeight(moreWords:"laser jet"), product of: > > 14.597603 = idf(moreWords: laser=26731 jet=12685) > > 0.041507367 = queryNorm > > 0.0 = fieldWeight(moreWords:"laser jet" in 32497), product of: > > 0.0 = tf(phraseFreq=0.0) > > 14.597603 = idf(moreWords: laser=26731 jet=12685) > > 0.078125 = fieldNorm(field=moreWords, doc=32497) > > > > </str> > > </lst> > >