[
https://issues.apache.org/jira/browse/OAK-4844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15522577#comment-15522577
]
Alex Parvulescu commented on OAK-4844:
--------------------------------------
test results using the patch: size is {{13GB}}
{noformat
Total size:
13 GB in 56521 data segments
768 KB in 3 bulk segments
3 GB in maps (46450859 leaf and branch records)
1 GB in lists (55469092 list and bucket records)
3 GB in values (value and block records of 70765678 properties,
3429/378684/0/1214419 small/medium/long/external blobs, 46258452/1862224/159
small/medium/long strings)
161 MB in templates (16772712 template records)
2 GB in nodes (251591739 node records)
links to non existing segments: []
{noformat}
> Analyse effects of simplified record ids
> ----------------------------------------
>
> Key: OAK-4844
> URL: https://issues.apache.org/jira/browse/OAK-4844
> Project: Jackrabbit Oak
> Issue Type: Improvement
> Components: segment-tar
> Reporter: Michael Dürig
> Assignee: Michael Dürig
> Labels: performance
> Fix For: Segment Tar 0.0.14
>
> Attachments: OAK-4844.patch
>
>
> OAK-4631 introduced a simplified serialisation for record ids. This causes
> their footprint on disk to increase from 3 bytes to 18 bytes. OAK-4631 has
> some initial analysis on the effect this is having on repositories as a
> whole.
> I'm opening this issue as a dedicated task to further look into mitigation
> strategies (if necessary).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)