[
https://issues.apache.org/jira/browse/OAK-1732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcel Reutegger updated OAK-1732:
----------------------------------
Fix Version/s: 1.0
Merged into 1.0 branch in http://svn.apache.org/r1587560
> Cluster node lease not renewed in time
> --------------------------------------
>
> Key: OAK-1732
> URL: https://issues.apache.org/jira/browse/OAK-1732
> Project: Jackrabbit Oak
> Issue Type: Bug
> Components: core, mongomk
> Affects Versions: 0.20.0
> Reporter: Marcel Reutegger
> Assignee: Marcel Reutegger
> Priority: Blocker
> Fix For: 1.0, 1.1
>
>
> A cluster node lease is renewed periodically to indicate that a given
> clusterId is in use. This happens twice the async delay before the lease
> expires. With the default configuration this means the lease is renewed two
> seconds before it expires. Load test with an oak cluster shows that the lease
> sometimes is not renewed in time and the LastRevRecoveryAgent kicks in even
> though the cluster nodes are still running.
> I see a couple of options to fix this:
> - Renew the lease earlier. E.g. after half of the lease timeout, or even with
> every background operation run. This would be roughly once a second.
> - Decouple the lease renew from the background operation thread where also
> more expensive operations are executed (_lastRev updates and cache
> invalidation).
--
This message was sent by Atlassian JIRA
(v6.2#6252)