edismax pf clause makes no sense when query already has quoted subphrase ------------------------------------------------------------------------
Key: SOLR-3008 URL: https://issues.apache.org/jira/browse/SOLR-3008 Project: Solr Issue Type: Bug Reporter: Hoss Man As noted by ldavid2020 on the solr-user mailing list (Tue, 20 Dec 2011) the behavior of edismax when the pf param is used and the query string contains quotes arround part of the query makes no sense at all... {quote} For the same query: 2012 "japan airlines" flight status dismax... [http://localhost:8983/solr/select?q=2012+"japan+airlines"+flight+status&qf=TTL&pf=TTL&debugQuery=true&defType=dismax] outputs: {noformat} +((DisjunctionMaxQuery((TTL:2012)~0.1) DisjunctionMaxQuery((TTL:"japan airlin"~3)~0.1) DisjunctionMaxQuery((TTL:flight)~0.1) DisjunctionMaxQuery((TTL:status)~0.1) )~3) DisjunctionMaxQuery((TTL:"2012 japan airlin flight status"~3)~0.1) {noformat} The parsedquery has DisjunctionMaxQuery((TTL:"2012 japan airlin flight status"~3)~0.1). While edismax... [http://localhost:8983/solr/select?q=2012+"japan+airlines"+flight+status&qf=TTL&pf=TTL&debugQuery=true&defType=edismax] outputs: {noformat} +((DisjunctionMaxQuery((TTL:2012)~0.1) DisjunctionMaxQuery((TTL:"japan airlin"~3)~0.1) DisjunctionMaxQuery((TTL:flight)~0.1) DisjunctionMaxQuery((TTL:status)~0.1) )~3) DisjunctionMaxQuery((TTL:"2012 flight status"~3)~0.1) {noformat} The parsedquery has DisjunctionMaxQuery((TTL:"2012 flight status"~3)~0.1). ... So it seems edismax ignores "japan airlines" for the pf matching. This could cause some issues, in that a document with exactly phrase "2012 japan airlines flight status" will have the same relevancy score with another one with two phrases "japan airlines", "2012 flight status" far away. {quote} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org