[ 
https://issues.apache.org/jira/browse/LUCENE-8562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16692087#comment-16692087
 ] 

Adrien Grand commented on LUCENE-8562:
--------------------------------------

+1 to not sort data dimensions. Do we really need the call to {{PathSlice 
dataSlice = switchToHeap(slices[0], toCloseHeroically)}}, it should already 
have been called earlier? For simplicity maybe we should only compute the 
common prefix length on data dimensions and not even try to use data dimensions 
as a "sortedDim"?



> Speed up merging segments of points with data dimensions
> --------------------------------------------------------
>
>                 Key: LUCENE-8562
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8562
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/index
>    Affects Versions: master (8.0), 7.7
>            Reporter: Ignacio Vera
>            Priority: Major
>         Attachments: LUCENE-8562.patch
>
>
> Currently when merging segments of points with data dimensions, all 
> dimensions are sorted and carried over down the tree even though only 
> indexing dimensions are needed to build the BKD tree. This is needed so leaf 
> node data can be compressed by common prefix.
> But when using _MutablePointValues_, this ordering is done at the leaf level 
> so we can se a similar approach from data dimensions and delay the sorting at 
> leaf level. This seems to speed up indexing time as well as reduce the 
> storage needed for building the index.
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to