It's probably the WordDelimiterFilter:

> org.apache.solr.analysis.WordDelimiterFilterFactory args:{preserveOriginal:
> 1 splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 0
> generateWordParts: 1 catenateAll: 0 catenateNumbers: 0 }

Get rid of the preserveOriginal="1" in the query analyzer.

-Yonik
http://lucidimagination.com

On Tue, Mar 1, 2011 at 9:01 AM, Glòria Martínez
<gloria.marti...@careesma.com> wrote:
> Hello,
>
> I can't understand why this query is not matching anything. Could someone
> help me please?
>
> *Query*
> http://localhost:8894/solr/select?q=linguajob.pl&qf=company_name&wt=xml&qt=dismax&debugQuery=on&explainOther=id%3A1
>
> <response>
> -
> <lst name="responseHeader">
> <int name="status">0</int>
> <int name="QTime">12</int>
> -
> <lst name="params">
> <str name="explainOther">id:1</str>
> <str name="debugQuery">on</str>
> <str name="q">linguajob.pl</str>
> <str name="qf">company_name</str>
> <str name="wt">xml</str>
> <str name="qt">dismax</str>
> </lst>
> </lst>
> <result name="response" numFound="0" start="0"/>
> -
> <lst name="debug">
> <str name="rawquerystring">linguajob.pl</str>
> <str name="querystring">linguajob.pl</str>
> -
> <str name="parsedquery">
> +DisjunctionMaxQuery((company_name:"(linguajob.pl linguajob) pl")~0.01) ()
> </str>
> -
> <str name="parsedquery_toString">
> +(company_name:"(linguajob.pl linguajob) pl")~0.01 ()
> </str>
> <lst name="explain"/>
> <str name="otherQuery">id:1</str>
> -
> <lst name="explainOther">
> -
> <str name="1">
>
> 0.0 = (NON-MATCH) Failure to meet condition(s) of required/prohibited
> clause(s)
>  0.0 = no match on required clause (company_name:"(linguajob.pl linguajob)
> pl") *<- What does this syntax (field:"(token1 token2) token3") mean?*
>    0.0 = (NON-MATCH) fieldWeight(company_name:"(linguajob.pl linguajob) pl"
> in 0), product of:
>      0.0 = tf(phraseFreq=0.0)
>      1.6137056 = idf(company_name:"(linguajob.pl linguajob) pl")
>      0.4375 = fieldNorm(field=company_name, doc=0)
> </str>
> </lst>
> <str name="QParser">DisMaxQParser</str>
> <null name="altquerystring"/>
> <null name="boostfuncs"/>
> +
> <lst name="timing">
> ...
> </response>
>
>
>
> There's only one document indexed:
>
> *Document*
> http://localhost:8894/solr/select?q=1&qf=id&wt=xml&qt=dismax
> <response>
> -
> <lst name="responseHeader">
> <int name="status">0</int>
> <int name="QTime">2</int>
> -
> <lst name="params">
> <str name="qf">id</str>
> <str name="wt">xml</str>
> <str name="qt">dismax</str>
> <str name="q">1</str>
> </lst>
> </lst>
> -
> <result name="response" numFound="1" start="0">
> -
> <doc>
> <str name="company_name">LinguaJob.pl</str>
> <str name="id">1</str>
> <int name="status">6</int>
> <date name="timestamp">2011-03-01T11:14:24.553Z</date>
> </doc>
> </result>
> </response>
>
> *Solr Admin Schema*
> Field: company_name
> Field Type: text
> Properties: Indexed, Tokenized, Stored
> Schema: Indexed, Tokenized, Stored
> Index: Indexed, Tokenized, Stored
>
> Position Increment Gap: 100
>
> Index Analyzer: org.apache.solr.analysis.TokenizerChain Details
> Tokenizer Class: org.apache.solr.analysis.WhitespaceTokenizerFactory
> Filters:
> schema.UnicodeNormalizationFilterFactory args:{composed: false
> remove_modifiers: true fold: true version: java6 remove_diacritics: true }
> org.apache.solr.analysis.StopFilterFactory args:{words: stopwords.txt
> ignoreCase: true enablePositionIncrements: true }
> org.apache.solr.analysis.WordDelimiterFilterFactory args:{preserveOriginal:
> 1 splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 1
> generateWordParts: 1 catenateAll: 0 catenateNumbers: 1 }
> org.apache.solr.analysis.LowerCaseFilterFactory args:{}
> org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory args:{}
>
> Query Analyzer: org.apache.solr.analysis.TokenizerChain Details
> Tokenizer Class: org.apache.solr.analysis.WhitespaceTokenizerFactory
> Filters:
> schema.UnicodeNormalizationFilterFactory args:{composed: false
> remove_modifiers: true fold: true version: java6 remove_diacritics: true }
> org.apache.solr.analysis.SynonymFilterFactory args:{synonyms: synonyms.txt
> expand: true ignoreCase: true }
> org.apache.solr.analysis.StopFilterFactory args:{words: stopwords.txt
> ignoreCase: true }
> org.apache.solr.analysis.WordDelimiterFilterFactory args:{preserveOriginal:
> 1 splitOnCaseChange: 1 generateNumberParts: 1 catenateWords: 0
> generateWordParts: 1 catenateAll: 0 catenateNumbers: 0 }
> org.apache.solr.analysis.LowerCaseFilterFactory args:{}
> org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory args:{}
>
> Docs: 1
> Distinct: 5
> Top 5 terms
> term frequency
> lingua 1
> linguajob.pl 1
> linguajobpl 1
> pl 1
> job 1
>
> *Solr Analysis*
> Field name: company_name
> Field value (Index): LinguaJob.pl
> Field value (Query): linguajob.pl
>
> *Index Analyzer
>
> org.apache.solr.analysis.WhitespaceTokenizerFactory {}
> term position 1
> term text LinguaJob.pl
> term type word
> source start,end 0,12
> payload
>
> schema.UnicodeNormalizationFilterFactory {composed=false,
> remove_modifiers=true, fold=true, version=java6, remove_diacritics=true}
> term position 1
> term text LinguaJob.pl
> term type word
> source start,end 0,12
> payload
>
> org.apache.solr.analysis.StopFilterFactory {words=stopwords.txt,
> ignoreCase=true, enablePositionIncrements=true}
> term position 1
> term text LinguaJob.pl
> term type word
> source start,end 0,12
> payload
>
> org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1,
> splitOnCaseChange=1, generateNumberParts=1, catenateWords=1,
> generateWordParts=1, catenateAll=0, catenateNumbers=1}
> term position 123
> term text LinguaJob.plJobpl
> LinguaLinguaJobpl
> term type wordwordword
> wordword
> source start,end 0,126,910,12
> 0,60,12
> payload
>
> org.apache.solr.analysis.LowerCaseFilterFactory {}
> term position 123
> term text linguajob.pljobpl
> lingualinguajobpl
> term type wordwordword
> wordword
> source start,end 0,126,910,12
> 0,60,12
> payload
>
> org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {}
> term position 123
> term text linguajob.pljobpl
> lingualinguajobpl
> term type wordwordword
> wordword
> source start,end 0,126,910,12
> 0,60,12
> payload
>
> *Query Analyzer
>
> org.apache.solr.analysis.WhitespaceTokenizerFactory {}
> term position 1
> term text linguajob.pl
> term type word
> source start,end 0,12
> payload
>
> schema.UnicodeNormalizationFilterFactory {composed=false,
> remove_modifiers=true, fold=true, version=java6, remove_diacritics=true}
> term position 1
> term text linguajob.pl
> term type word
> source start,end 0,12
> payload
>
> org.apache.solr.analysis.SynonymFilterFactory {synonyms=synonyms.txt,
> expand=true, ignoreCase=true}
> term position 1
> term text linguajob.pl
> term type word
> source start,end 0,12
> payload
>
> org.apache.solr.analysis.StopFilterFactory {words=stopwords.txt,
> ignoreCase=true}
> term position 1
> term text linguajob.pl
> term type word
> source start,end 0,12
> payload
>
> org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1,
> splitOnCaseChange=1, generateNumberParts=1, catenateWords=0,
> generateWordParts=1, catenateAll=0, catenateNumbers=0}
> term position 12
> term text linguajob.plpl
> linguajob
> term type wordword
> word
> source start,end 0,1210,12
> 0,9
> payload
>
> org.apache.solr.analysis.LowerCaseFilterFactory {}
> term position 12
> term text linguajob.plpl
> linguajob
> term type wordword
> word
> source start,end 0,1210,12
> 0,9
> payload
>
> org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {}
> term position 12
> term text linguajob.plpl
> linguajob
> term type wordword
> word
> source start,end 0,1210,12
> 0,9
> payload
>
> Thank you very much!
> Glòria Martinez.
>

Reply via email to