[ http://issues.apache.org/jira/browse/LUCENE-524?page=all ]
Randy Puttick updated LUCENE-524: --------------------------------- Attachment: MultiTermQuery.java MultiTermScorer.java Implements union operation on a priority queue and scores multi-term based on maximum over terms versus essentially the average > Current implementation of fuzzy and wildcard queries inappropriately > implemented as Boolean query rewrites > ---------------------------------------------------------------------------------------------------------- > > Key: LUCENE-524 > URL: http://issues.apache.org/jira/browse/LUCENE-524 > Project: Lucene - Java > Type: Improvement > Components: Search > Versions: 1.9 > Reporter: Randy Puttick > Attachments: MultiTermQuery.java, MultiTermScorer.java > > The implementation of MultiTermQuery in terms of BooleanQuery introduces > several problems: > 1) Collisions with maximum clause limit on boolean queries which throws an > exception. This is most problematic because it is difficult to ascertain in > advance how many terms a fuzzy query or wildcard query might involve. > 2) The boolean disjunctive scoring is not appropriate for either fuzzy or > wildcard queries. In effect the score is divided by the number of terms in > the query which has nothing to do with the relevancy of the results. > 3) Performance of disjunctive boolean queries for large term sets is quite > sub-optimal -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]