[ https://issues.apache.org/jira/browse/LUCENE-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15401345#comment-15401345 ]
Michael McCandless commented on LUCENE-7401: -------------------------------------------- Are you trying to address the adersarial case of indexing e.g. a narrow sliver of points? Another fun one is if all indexed points are equidistant from an origin. I've wondered whether cells should be "shrink wrapped" during indexing to handle this one... There are quite a few papers that explore different splitting techniques to have better behavior with challenging cases. > BKDWriter should ensure all dimensions are indexed > -------------------------------------------------- > > Key: LUCENE-7401 > URL: https://issues.apache.org/jira/browse/LUCENE-7401 > Project: Lucene - Core > Issue Type: Bug > Reporter: Adrien Grand > Priority: Minor > > The current heuristic is to use the dimension that has the largest span, so > if dimensions have a different distribution of values, we could theoretically > (but maybe in practice too?) end up with one dimension that is not indexed at > all and queries that are mostly selective on this dimension would need to > scan lots of blocks. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org