[ http://issues.apache.org/jira/browse/LUCENE-524?page=all ]
Randy Puttick updated LUCENE-524:
---------------------------------
Attachment: MultiTermQuery.java
MultiTermScorer.java
Implements union operation on a priority queue and scores multi-term based on
maximum over terms versus essentially the average
> Current implementation of fuzzy and wildcard queries inappropriately
> implemented as Boolean query rewrites
> ----------------------------------------------------------------------------------------------------------
>
> Key: LUCENE-524
> URL: http://issues.apache.org/jira/browse/LUCENE-524
> Project: Lucene - Java
> Type: Improvement
> Components: Search
> Versions: 1.9
> Reporter: Randy Puttick
> Attachments: MultiTermQuery.java, MultiTermScorer.java
>
> The implementation of MultiTermQuery in terms of BooleanQuery introduces
> several problems:
> 1) Collisions with maximum clause limit on boolean queries which throws an
> exception. This is most problematic because it is difficult to ascertain in
> advance how many terms a fuzzy query or wildcard query might involve.
> 2) The boolean disjunctive scoring is not appropriate for either fuzzy or
> wildcard queries. In effect the score is divided by the number of terms in
> the query which has nothing to do with the relevancy of the results.
> 3) Performance of disjunctive boolean queries for large term sets is quite
> sub-optimal
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]