Thank you very much! On Wed, Mar 9, 2011 at 2:01 AM, Yonik Seeley <yo...@lucidimagination.com>wrote:
> It's probably the WordDelimiterFilter: > > > org.apache.solr.analysis.WordDelimiterFilterFactory > args:{preserveOriginal: > > 1 splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 0 > > generateWordParts: 1 catenateAll: 0 catenateNumbers: 0 } > > Get rid of the preserveOriginal="1" in the query analyzer. > > -Yonik > http://lucidimagination.com > > On Tue, Mar 1, 2011 at 9:01 AM, Glòria Martínez > <gloria.marti...@careesma.com> wrote: > > Hello, > > > > I can't understand why this query is not matching anything. Could someone > > help me please? > > > > *Query* > > > http://localhost:8894/solr/select?q=linguajob.pl&qf=company_name&wt=xml&qt=dismax&debugQuery=on&explainOther=id%3A1 > > > > <response> > > - > > <lst name="responseHeader"> > > <int name="status">0</int> > > <int name="QTime">12</int> > > - > > <lst name="params"> > > <str name="explainOther">id:1</str> > > <str name="debugQuery">on</str> > > <str name="q">linguajob.pl</str> > > <str name="qf">company_name</str> > > <str name="wt">xml</str> > > <str name="qt">dismax</str> > > </lst> > > </lst> > > <result name="response" numFound="0" start="0"/> > > - > > <lst name="debug"> > > <str name="rawquerystring">linguajob.pl</str> > > <str name="querystring">linguajob.pl</str> > > - > > <str name="parsedquery"> > > +DisjunctionMaxQuery((company_name:"(linguajob.pl linguajob) pl")~0.01) > () > > </str> > > - > > <str name="parsedquery_toString"> > > +(company_name:"(linguajob.pl linguajob) pl")~0.01 () > > </str> > > <lst name="explain"/> > > <str name="otherQuery">id:1</str> > > - > > <lst name="explainOther"> > > - > > <str name="1"> > > > > 0.0 = (NON-MATCH) Failure to meet condition(s) of required/prohibited > > clause(s) > > 0.0 = no match on required clause (company_name:"(linguajob.pllinguajob) > > pl") *<- What does this syntax (field:"(token1 token2) token3") mean?* > > 0.0 = (NON-MATCH) fieldWeight(company_name:"(linguajob.pl linguajob) > pl" > > in 0), product of: > > 0.0 = tf(phraseFreq=0.0) > > 1.6137056 = idf(company_name:"(linguajob.pl linguajob) pl") > > 0.4375 = fieldNorm(field=company_name, doc=0) > > </str> > > </lst> > > <str name="QParser">DisMaxQParser</str> > > <null name="altquerystring"/> > > <null name="boostfuncs"/> > > + > > <lst name="timing"> > > ... > > </response> > > > > > > > > There's only one document indexed: > > > > *Document* > > http://localhost:8894/solr/select?q=1&qf=id&wt=xml&qt=dismax > > <response> > > - > > <lst name="responseHeader"> > > <int name="status">0</int> > > <int name="QTime">2</int> > > - > > <lst name="params"> > > <str name="qf">id</str> > > <str name="wt">xml</str> > > <str name="qt">dismax</str> > > <str name="q">1</str> > > </lst> > > </lst> > > - > > <result name="response" numFound="1" start="0"> > > - > > <doc> > > <str name="company_name">LinguaJob.pl</str> > > <str name="id">1</str> > > <int name="status">6</int> > > <date name="timestamp">2011-03-01T11:14:24.553Z</date> > > </doc> > > </result> > > </response> > > > > *Solr Admin Schema* > > Field: company_name > > Field Type: text > > Properties: Indexed, Tokenized, Stored > > Schema: Indexed, Tokenized, Stored > > Index: Indexed, Tokenized, Stored > > > > Position Increment Gap: 100 > > > > Index Analyzer: org.apache.solr.analysis.TokenizerChain Details > > Tokenizer Class: org.apache.solr.analysis.WhitespaceTokenizerFactory > > Filters: > > schema.UnicodeNormalizationFilterFactory args:{composed: false > > remove_modifiers: true fold: true version: java6 remove_diacritics: true > } > > org.apache.solr.analysis.StopFilterFactory args:{words: stopwords.txt > > ignoreCase: true enablePositionIncrements: true } > > org.apache.solr.analysis.WordDelimiterFilterFactory > args:{preserveOriginal: > > 1 splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 1 > > generateWordParts: 1 catenateAll: 0 catenateNumbers: 1 } > > org.apache.solr.analysis.LowerCaseFilterFactory args:{} > > org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory args:{} > > > > Query Analyzer: org.apache.solr.analysis.TokenizerChain Details > > Tokenizer Class: org.apache.solr.analysis.WhitespaceTokenizerFactory > > Filters: > > schema.UnicodeNormalizationFilterFactory args:{composed: false > > remove_modifiers: true fold: true version: java6 remove_diacritics: true > } > > org.apache.solr.analysis.SynonymFilterFactory args:{synonyms: > synonyms.txt > > expand: true ignoreCase: true } > > org.apache.solr.analysis.StopFilterFactory args:{words: stopwords.txt > > ignoreCase: true } > > org.apache.solr.analysis.WordDelimiterFilterFactory > args:{preserveOriginal: > > 1 splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 0 > > generateWordParts: 1 catenateAll: 0 catenateNumbers: 0 } > > org.apache.solr.analysis.LowerCaseFilterFactory args:{} > > org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory args:{} > > > > Docs: 1 > > Distinct: 5 > > Top 5 terms > > term frequency > > lingua 1 > > linguajob.pl 1 > > linguajobpl 1 > > pl 1 > > job 1 > > > > *Solr Analysis* > > Field name: company_name > > Field value (Index): LinguaJob.pl > > Field value (Query): linguajob.pl > > > > *Index Analyzer > > > > org.apache.solr.analysis.WhitespaceTokenizerFactory {} > > term position 1 > > term text LinguaJob.pl > > term type word > > source start,end 0,12 > > payload > > > > schema.UnicodeNormalizationFilterFactory {composed=false, > > remove_modifiers=true, fold=true, version=java6, remove_diacritics=true} > > term position 1 > > term text LinguaJob.pl > > term type word > > source start,end 0,12 > > payload > > > > org.apache.solr.analysis.StopFilterFactory {words=stopwords.txt, > > ignoreCase=true, enablePositionIncrements=true} > > term position 1 > > term text LinguaJob.pl > > term type word > > source start,end 0,12 > > payload > > > > org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1, > > splitOnCaseChange=1, generateNumberParts=1, catenateWords=1, > > generateWordParts=1, catenateAll=0, catenateNumbers=1} > > term position 123 > > term text LinguaJob.plJobpl > > LinguaLinguaJobpl > > term type wordwordword > > wordword > > source start,end 0,126,910,12 > > 0,60,12 > > payload > > > > org.apache.solr.analysis.LowerCaseFilterFactory {} > > term position 123 > > term text linguajob.pljobpl > > lingualinguajobpl > > term type wordwordword > > wordword > > source start,end 0,126,910,12 > > 0,60,12 > > payload > > > > org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {} > > term position 123 > > term text linguajob.pljobpl > > lingualinguajobpl > > term type wordwordword > > wordword > > source start,end 0,126,910,12 > > 0,60,12 > > payload > > > > *Query Analyzer > > > > org.apache.solr.analysis.WhitespaceTokenizerFactory {} > > term position 1 > > term text linguajob.pl > > term type word > > source start,end 0,12 > > payload > > > > schema.UnicodeNormalizationFilterFactory {composed=false, > > remove_modifiers=true, fold=true, version=java6, remove_diacritics=true} > > term position 1 > > term text linguajob.pl > > term type word > > source start,end 0,12 > > payload > > > > org.apache.solr.analysis.SynonymFilterFactory {synonyms=synonyms.txt, > > expand=true, ignoreCase=true} > > term position 1 > > term text linguajob.pl > > term type word > > source start,end 0,12 > > payload > > > > org.apache.solr.analysis.StopFilterFactory {words=stopwords.txt, > > ignoreCase=true} > > term position 1 > > term text linguajob.pl > > term type word > > source start,end 0,12 > > payload > > > > org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1, > > splitOnCaseChange=1, generateNumberParts=1, catenateWords=0, > > generateWordParts=1, catenateAll=0, catenateNumbers=0} > > term position 12 > > term text linguajob.plpl > > linguajob > > term type wordword > > word > > source start,end 0,1210,12 > > 0,9 > > payload > > > > org.apache.solr.analysis.LowerCaseFilterFactory {} > > term position 12 > > term text linguajob.plpl > > linguajob > > term type wordword > > word > > source start,end 0,1210,12 > > 0,9 > > payload > > > > org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {} > > term position 12 > > term text linguajob.plpl > > linguajob > > term type wordword > > word > > source start,end 0,1210,12 > > 0,9 > > payload > > > > Thank you very much! > > Glòria Martinez. > > > -- -- Glòria Martínez Hidalgo Software Developer InfoJobs International Passeig de Gracia, 11A, 6,1 08007 Barcelona (Spain)