[
https://issues.apache.org/jira/browse/LUCENE-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adrien Grand resolved LUCENE-8619.
----------------------------------
Resolution: Not A Problem
This isn't a problem anymore now that Ignacio rewrote the merging of BKD trees
as a selection problem rathen than a sorting problem.
> Decrease I/O pressure of OfflineSorter
> --------------------------------------
>
> Key: LUCENE-8619
> URL: https://issues.apache.org/jira/browse/LUCENE-8619
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Priority: Minor
>
> OfflineSorter is likely I/O bound, yet it doesn't really try to relieve I/O.
> For instance it always writes the length on 2 bytes, which is waseful when
> used by BKDWriter since all byte[] arrays have exactly the same length. For
> LatLonPoint, this is a 25% space overhead that we could remove.
> Doing lightweight compression on the fly might also help.
> As a data point, Ignacio told me that after indexing 60M shapes with
> LatLonShape (1.65B triangles), the index directory was about 265GB and
> dropped to 57GB when merging was over.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]