[
https://issues.apache.org/jira/browse/OOZIE-2467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15303126#comment-15303126
]
Purshotam Shah commented on OOZIE-2467:
---------------------------------------
Thanks Rohini for review. Committed to trunk.
> Oozie can shutdown itself on long GC pause
> ------------------------------------------
>
> Key: OOZIE-2467
> URL: https://issues.apache.org/jira/browse/OOZIE-2467
> Project: Oozie
> Issue Type: Bug
> Reporter: Purshotam Shah
> Assignee: Purshotam Shah
> Fix For: trunk
>
> Attachments: OOZIE-2467-V1.patch, OOZIE-2467-V2.patch
>
>
> There are two issues.
> 1. Oozie doesn't set ZK session timeout. So, curator uses default session
> timeout which is 1 min.
> If there is GC pause for more than 1 min, or frequent GC pause because of
> which curator can't send heartbeat, the connection will be dropped and Oozie
> will shutdown itself.
> 2. Oozie uses ExponentialBackoffRetry with initial wait = 1sec and max
> retries = 3.
> so Oozie will only retry 3 times, with duration 1sec, 2 sec and 4 sec. We
> need increase max retries count to at least 10.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)