[
https://issues.apache.org/jira/browse/LUCENE-8562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16694461#comment-16694461
]
Adrien Grand commented on LUCENE-8562:
--------------------------------------
bq. I think for common prefix to work you need all dimensions to be computed
but not 100% sure
Right, computing common prefixes is important. I was only considering removing
the logic that computes the cardinality of the leading uncommon byte and
potentially selects data dimensions as a "sortedDim" as this adds some tricky
code.
> Speed up merging segments of points with data dimensions
> --------------------------------------------------------
>
> Key: LUCENE-8562
> URL: https://issues.apache.org/jira/browse/LUCENE-8562
> Project: Lucene - Core
> Issue Type: Improvement
> Components: core/index
> Affects Versions: master (8.0), 7.7
> Reporter: Ignacio Vera
> Priority: Major
> Attachments: LUCENE-8562.patch, LUCENE-8562.patch
>
>
> Currently when merging segments of points with data dimensions, all
> dimensions are sorted and carried over down the tree even though only
> indexing dimensions are needed to build the BKD tree. This is needed so leaf
> node data can be compressed by common prefix.
> But when using _MutablePointValues_, this ordering is done at the leaf level
> so we can se a similar approach from data dimensions and delay the sorting at
> leaf level. This seems to speed up indexing time as well as reduce the
> storage needed for building the index.
>
>
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]