Manuel Lenormand created SOLR-6310:
--------------------------------------

             Summary: create a TypeQueryParser to query each token on it's 
matching field type
                 Key: SOLR-6310
                 URL: https://issues.apache.org/jira/browse/SOLR-6310
             Project: Solr
          Issue Type: Improvement
          Components: query parsers
    Affects Versions: 4.9
            Reporter: Manuel Lenormand
            Priority: Minor
             Fix For: 5.0, 4.10


Indexed documents frequently contain different types in different field, e.g 
emails, telephone numbers, ips etc. The fields may have been extracted from the 
content field or originally structured that way.

We should propose a queryParser that recognizes the query token type (eg. 
regex) and implicitly reformulate the query to run against the matching field 
only. That would make a good performance boost in case the query is running on 
a "catch them all" field and a more adapted analyze for the different types.
 It would also avoid the idf drift that occurs on an above "catch them all" 
field.

A workaround could be using the type token filter with the matching type 
whitelist and querying all the different field types with edismax's qf param.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to