[ 
https://issues.apache.org/jira/browse/SOLR-6062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14012039#comment-14012039
 ] 

Ron Mayer commented on SOLR-6062:
---------------------------------

Regarding "the original/linked issue allowed the same field to be passed 
through a pf parameter with differing slop values. The intent being that those 
scores would be combined, rather than the max being used across those fields".  
  The observation that lead to using the same field with different slop values 
was that if: either many of the words in searched clauses were in the same 
paragraph ( a pretty large slop value); or many pairs of words from search 
clauses were in the same adjectives/noun clauses of the text (quite small small 
slop value; to make a search for 'old hairy cat' rank well against 'hairy old 
cat' ) a document was likely to be interesting.

If I understand right, it sounds to me like what Michael described continue to 
be good for those cases.    I'm traveling this week, but have some test cases 
comparing ranking of solr-2058 vs human-sorted documents that I can run when 
I'm back thursday of next week.



> Phrase queries are created for each field supplied through edismax's pf, pf2 
> and pf3 parameters (rather them being combined in a single dismax query)
> -----------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-6062
>                 URL: https://issues.apache.org/jira/browse/SOLR-6062
>             Project: Solr
>          Issue Type: Bug
>          Components: query parsers
>    Affects Versions: 4.0
>            Reporter: Michael Dodsworth
>            Priority: Minor
>         Attachments: combined-phrased-dismax.patch
>
>
> https://issues.apache.org/jira/browse/SOLR-2058 subtly changed how phrase 
> queries, created through the pf, pf2 and pf3 parameters, are merged into the 
> main user query.
> For the query: 'term1 term2' with pf2:[field1, field2, field3] we now get 
> (omitting the non phrase query section for clarity):
> {code:java}
> <main query>
> DisjunctionMaxQuery((field1:"term1 term2"^1.0)~0.1)
> DisjunctionMaxQuery((field2:"term1 term2"^1.0)~0.1)
> DisjunctionMaxQuery((field3:"term1 term2"^1.0)~0.1)
> {code}
> Prior to this change, we had:
> {code:java}
> <main query> 
> DisjunctionMaxQuery((field1:"term1 term2"^1.0 | field2:"term1 term2"^1.0 | 
> field3:"term1 term2"^1.0)~0.1)
> {code}
> The upshot being that if the phrase query "term1 term2" appears in multiple 
> fields, it will get a significant boost over the previous implementation.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to