Alex Parvulescu created OAK-3168:
------------------------------------
Summary: SegmentCache flushes Segment on update
Key: OAK-3168
URL: https://issues.apache.org/jira/browse/OAK-3168
Project: Jackrabbit Oak
Issue Type: Bug
Components: segmentmk
Reporter: Alex Parvulescu
Assignee: Alex Parvulescu
The SegmentCache currently uses the cache eviction call to remove the Segment
instance from memory to help keep the cache memory requirements under control
[0].
What I've noticed though, is that for a cache update (existing key) there will
also be an eviction call happening, which results in a lot of extra IO pressure
on the SegmentStore which not only is not able to cache the segment, but is
forced to reload it multiple times as the reference gets nullified after each
load.
This comes from the sampling behavior of the SegmentId: it will not hit the
cache each time it needs to load a new Segment, but rather load it from IO and
(re)place it in the cache, based on a sampling rate [1].
Now I see 2 options:
* change the cache code to _not_ call the eviction callback on updates (or
allow disabling this call on updates)
* change the SegmentTracker code to add the value to the cache only if it's
not there as Segments are immutable, so no harm done.
Raised this issue offline with [~tmueller], [~mduerig] first and as I
understand [~mduerig] is in favor of option one, while [~tmueller] proposed
that the Lirs cache impl should be inline with what the guava cache does, and
depending on that we could choose the right fix here.
Hope this covers everything.
[0]
https://github.com/apache/jackrabbit-oak/blob/trunk/oak-core/src/main/java/org/apache/jackrabbit/oak/plugins/segment/SegmentTracker.java#L133
[1]
https://github.com/apache/jackrabbit-oak/blob/trunk/oak-core/src/main/java/org/apache/jackrabbit/oak/plugins/segment/SegmentId.java#L135
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)