[jira] [Commented] (SOLR-2087) Dismax handler not handling +/- correctly
[ https://issues.apache.org/jira/browse/SOLR-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15600840#comment-15600840 ] Cao Manh Dat commented on SOLR-2087: >From cwiki >https://cwiki.apache.org/confluence/display/solr/The+DisMax+Query+Parser {quote} The q parameter defines the main "query" constituting the essence of the search. The parameter supports raw input strings provided by users with no special escaping. The + and - characters are treated as "mandatory" and "prohibited" modifiers for terms. Text wrapped in balanced quote characters (for example, "San Jose") is treated as a phrase. Any query containing an odd number of quote characters is evaluated as if there were no quote characters at all. {quote} When I try to reprocedure this issue, here are the result {code} { "rawquerystring": "i'm a +walking +contradiction", "parsedquery_toString": "+(((_text_:i'm)^1.0) ((_text_:a)^1.0) +((_text_:walking)^1.0) +((_text_:contradiction)^1.0)) (content_txt_en:\"i'm ? walk contradict\")", } {code} So I think we can close this issue here as "fixed" > Dismax handler not handling +/- correctly > - > > Key: SOLR-2087 > URL: https://issues.apache.org/jira/browse/SOLR-2087 > Project: Solr > Issue Type: Bug > Components: query parsers >Affects Versions: 1.4 >Reporter: Gabriel Weinberg > > If I do a query like: i'm a walking contradiction it matches pf as > text:"i'm_a a_walking walking contradiction"^2.0, and it matches fine. > If I do a query like: i'm a +walking contradiction it matches pf as > text:"i'm_a a_+walking +walking contradiction"^2.0 and doesn't match at all. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-2087) Dismax handler not handling +/- correctly
[ https://issues.apache.org/jira/browse/SOLR-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15593917#comment-15593917 ] Cao Manh Dat commented on SOLR-2087: Erick Erickson : That make sense :) > Dismax handler not handling +/- correctly > - > > Key: SOLR-2087 > URL: https://issues.apache.org/jira/browse/SOLR-2087 > Project: Solr > Issue Type: Bug > Components: query parsers >Affects Versions: 1.4 >Reporter: Gabriel Weinberg > > If I do a query like: i'm a walking contradiction it matches pf as > text:"i'm_a a_walking walking contradiction"^2.0, and it matches fine. > If I do a query like: i'm a +walking contradiction it matches pf as > text:"i'm_a a_+walking +walking contradiction"^2.0 and doesn't match at all. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-2087) Dismax handler not handling +/- correctly
[ https://issues.apache.org/jira/browse/SOLR-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589582#comment-15589582 ] Erick Erickson commented on SOLR-2087: -- Dat: I went through in 2013 and started closing old issues based on age on the theory that if they hadn't been touched in a long time then they might as well be closed. Turns out that was an unpopular idea so I re-opened a bunch soon after closing. The JIRA may need to be closed on its own merits, but closing based on age and inactivity isn't good enough. Erick > Dismax handler not handling +/- correctly > - > > Key: SOLR-2087 > URL: https://issues.apache.org/jira/browse/SOLR-2087 > Project: Solr > Issue Type: Bug > Components: query parsers >Affects Versions: 1.4 >Reporter: Gabriel Weinberg > > If I do a query like: i'm a walking contradiction it matches pf as > text:"i'm_a a_walking walking contradiction"^2.0, and it matches fine. > If I do a query like: i'm a +walking contradiction it matches pf as > text:"i'm_a a_+walking +walking contradiction"^2.0 and doesn't match at all. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-2087) Dismax handler not handling +/- correctly
[ https://issues.apache.org/jira/browse/SOLR-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1552#comment-1552 ] Cao Manh Dat commented on SOLR-2087: [~hossman] [~erickerickson] : Do you think this issue is reopened by accidentally? > Dismax handler not handling +/- correctly > - > > Key: SOLR-2087 > URL: https://issues.apache.org/jira/browse/SOLR-2087 > Project: Solr > Issue Type: Bug > Components: query parsers >Affects Versions: 1.4 >Reporter: Gabriel Weinberg > > If I do a query like: i'm a walking contradiction it matches pf as > text:"i'm_a a_walking walking contradiction"^2.0, and it matches fine. > If I do a query like: i'm a +walking contradiction it matches pf as > text:"i'm_a a_+walking +walking contradiction"^2.0 and doesn't match at all. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Commented: (SOLR-2087) Dismax handler not handling +/- correctly
[ https://issues.apache.org/jira/browse/SOLR-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902119#action_12902119 ] Hoss Man commented on SOLR-2087: It's possible some improvements could be made here, but this is functioning as originally designed. the pf param treats the entire q param as one big quoted phrase, without any special consideration of other markup, in order to give a boost to things that *exactly* match the users input in any of the pf fields. (ie: users cut/paste an exact line of text from the content they are looking for. in your use case, it might make sense to use an alternate tokenizer in the fields you want to use in your pf (perhaps something that tokenizes away the leading +/- characters) but there are other uses cases where people have content that includes those characters, and they want the phrase boosting part of the query to account for them. Dismax handler not handling +/- correctly - Key: SOLR-2087 URL: https://issues.apache.org/jira/browse/SOLR-2087 Project: Solr Issue Type: Bug Components: search Affects Versions: 1.4 Reporter: Gabriel Weinberg If I do a query like: i'm a walking contradiction it matches pf as text:i'm_a a_walking walking contradiction^2.0, and it matches fine. If I do a query like: i'm a +walking contradiction it matches pf as text:i'm_a a_+walking +walking contradiction^2.0 and doesn't match at all. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Commented: (SOLR-2087) Dismax handler not handling +/- correctly
[ https://issues.apache.org/jira/browse/SOLR-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12902126#action_12902126 ] Gabriel Weinberg commented on SOLR-2087: Hoss, thanks for clarifying, but I'm still a bit confused. What I had in mind was that this current behavior breaks anything but a blank mm field. That is, if either the user (or I on the backend) mark any fields as mandatory or prohibited, the pf is rendered useless. Is that what you mean by intended behavior? I will try the tokenizer suggestion though. Dismax handler not handling +/- correctly - Key: SOLR-2087 URL: https://issues.apache.org/jira/browse/SOLR-2087 Project: Solr Issue Type: Bug Components: search Affects Versions: 1.4 Reporter: Gabriel Weinberg If I do a query like: i'm a walking contradiction it matches pf as text:i'm_a a_walking walking contradiction^2.0, and it matches fine. If I do a query like: i'm a +walking contradiction it matches pf as text:i'm_a a_+walking +walking contradiction^2.0 and doesn't match at all. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org