[ https://issues.apache.org/jira/browse/OAK-6333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chetan Mehrotra updated OAK-6333: --------------------------------- Labels: (was: candidate_oak_1_4 candidate_oak_1_6) Fix Version/s: 1.4.18 1.6.4 Merged to * 1.6 - 1803762 * 1.4 - 1803776 For older branches this needs to be explicitly enabled by setting system property {{oak.lucene.useActualEntryCount}} to {{true}} > IndexPlanner should use actual entryCount instead of limiting it to 1000 > ------------------------------------------------------------------------ > > Key: OAK-6333 > URL: https://issues.apache.org/jira/browse/OAK-6333 > Project: Jackrabbit Oak > Issue Type: Improvement > Components: lucene > Reporter: Chetan Mehrotra > Assignee: Chetan Mehrotra > Fix For: 1.8, 1.7.4, 1.6.4, 1.4.18 > > > Currently IndexPlanner uses following logic for estimating the entryCount > # If the index has fulltext indexing enable then and query has a fulltext > constraint clause specified > ## If {{entryCount}} value is defined then min(entryCount, numOfDocs) > ## If not then use the {{numDocs}} i.e. actual entry count > # If the index is pure property index i.e. none of the property definitions > have {{analyzed}} set to true > ## If {{entryCount}} value is defined then min(entryCount, numOfDocs) > ## Else Take min(1000, numDocs) > Revisiting the logic for #2 it appears in 1.0.x days (OAK-2200) we capped it > to 1000 because cost estimation for property indexes was inaccurate (they > used to report low values causing lucene index to loose). > With support for Counters the cost estimation for property index has improved > and now we should remove this capping and let it use numDocs. > One area where it causes issue is when we have two indexes where one is > superset of other. For e.g. /oak:index/asset and /content/en/ > /oak:index/asset where both have some matching properties. Logically if query > can be handled by sub index then it should get picked but currently either of > them can be picked making query plan undeterministic -- This message was sent by Atlassian JIRA (v6.4.14#64029)