[
https://issues.apache.org/jira/browse/OAK-4201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Dürig updated OAK-4201:
-------------------------------
Component/s: (was: segmentmk)
segment-next
> Add an index of binary references in a tar file
> -----------------------------------------------
>
> Key: OAK-4201
> URL: https://issues.apache.org/jira/browse/OAK-4201
> Project: Jackrabbit Oak
> Issue Type: Improvement
> Components: segment-next
> Reporter: Chetan Mehrotra
> Fix For: 1.6
>
>
> Currently for Blob GC in case of segment {{SegmentBlobReferenceRetriever}}
> goes through all tar files and extracts the binary references. This has 2
> issues
> # Logic has go through all the segments in all tar files
> # All segments get loaded in memory once which would affect normal system
> performance
> This process can be optimized if we also write a file entry in tar (similar
> to gph i.e. graph and idx i.e. index files) which has entries of all binary
> references referred to in any segment present in that tar file. Then GC logic
> would just have read this file and avoid scanning all the segments
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)