[ 
https://issues.apache.org/jira/browse/LUCENE-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15401345#comment-15401345
 ] 

Michael McCandless commented on LUCENE-7401:
--------------------------------------------

Are you trying to address the adersarial case of indexing e.g. a narrow sliver 
of points?

Another fun one is if all indexed points are equidistant from an origin.  I've 
wondered whether cells should be "shrink wrapped" during indexing to handle 
this one...

There are quite a few papers that explore different splitting techniques to 
have better behavior with challenging cases.

> BKDWriter should ensure all dimensions are indexed
> --------------------------------------------------
>
>                 Key: LUCENE-7401
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7401
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Adrien Grand
>            Priority: Minor
>
> The current heuristic is to use the dimension that has the largest span, so 
> if dimensions have a different distribution of values, we could theoretically 
> (but maybe in practice too?) end up with one dimension that is not indexed at 
> all and queries that are mostly selective on this dimension would need to 
> scan lots of blocks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to