[ http://issues.apache.org/jira/browse/LUCENE-524?page=all ]

Randy Puttick updated LUCENE-524:
---------------------------------

    Attachment: MultiTermQuery.java
                MultiTermScorer.java

Implements union operation on a priority queue and scores multi-term based on 
maximum over terms versus essentially the average

> Current implementation of fuzzy and wildcard queries inappropriately 
> implemented as Boolean query rewrites
> ----------------------------------------------------------------------------------------------------------
>
>          Key: LUCENE-524
>          URL: http://issues.apache.org/jira/browse/LUCENE-524
>      Project: Lucene - Java
>         Type: Improvement
>   Components: Search
>     Versions: 1.9
>     Reporter: Randy Puttick
>  Attachments: MultiTermQuery.java, MultiTermScorer.java
>
> The implementation of MultiTermQuery in terms of BooleanQuery introduces 
> several problems:
> 1) Collisions with maximum clause limit on boolean queries which throws an 
> exception.  This is most problematic because it is difficult to ascertain in 
> advance how many terms a fuzzy query or wildcard query might involve.
> 2) The boolean disjunctive scoring is not appropriate for either fuzzy or 
> wildcard queries.  In effect the score is divided by the number of terms in 
> the query which has nothing to do with the relevancy of the results.
> 3) Performance of disjunctive boolean queries for large term sets is quite 
> sub-optimal

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to