Rupert Westenthaler created STANBOL-607:
-------------------------------------------

             Summary: SolrYard should use quotes instead of AND for multi word 
TextConstraints
                 Key: STANBOL-607
                 URL: https://issues.apache.org/jira/browse/STANBOL-607
             Project: Stanbol
          Issue Type: Improvement
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
            Priority: Minor
             Fix For: 0.10.0-incubating


Currently a Text constraint for rdfs:label containing "The Book of Three" is 
encoded like

    (_\!@/rdfs\:label/:The) AND ((_\!@/rdfs\:label/:Book) AND  
(_\!@/rdfs\:label/:of) AND(_\!@/rdfs\:label/:Three))

however Solr/Lucene allow to use quotes for multi word searches. So the correct 
way to encode this query would be

    ((_\!@/rdfs\:label/:"The Book of Three"))

This need to be fixed in the 
"org.apache.stanbol.entityhub.yard.solr.query.QueryUtils#encodeQueryValue(..)

NOTE: The impact of this change for Wildcard queries need to be further 
investigated

e.g  take a query "Frankf* am Main" currently encoded like

    (_\!@/rdfs\:label/:Frankf*) AND ((_\!@/rdfs\:label/:am) AND  
(_\!@/rdfs\:label/:main))

would than result in

    ((_\!@/rdfs\:label/:"Frankf* am Main"))



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to