[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12932708#action_12932708
 ] 

Patrick Hunt commented on ZOOKEEPER-366:
----------------------------------------

FYI, this came up again today on hbase list:

14:59 < _hp_> man this system time update on a bunch of machines causing 
zookeeper session timeouts causing hr's to die is really taking its toll, count 
on a table now hangs, i disabled and enabled the table, tried count again, and 
it hangs at the same place still.  Arg.


Ben any progress on this? Should we try to get it into 3.3.3?

> Session timeout detection can go wrong if the leader system time changes
> ------------------------------------------------------------------------
>
>                 Key: ZOOKEEPER-366
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-366
>             Project: Zookeeper
>          Issue Type: Bug
>            Reporter: Benjamin Reed
>            Assignee: Benjamin Reed
>         Attachments: ZOOKEEPER-366.patch
>
>
> the leader tracks session expirations by calculating when a session will 
> timeout and then periodically checking to see what needs to be timed out 
> based on the current time. this works great as long as the leaders clock 
> progresses at a steady pace. the problem comes when there are big (session 
> size) changes in clock, by ntp for example. if time gets adjusted forward, 
> all the sessions could timeout immediately. if time goes backward sessions 
> that should timeout may take a lot longer to actually expire.
> this is really just a leader issue. the easiest way to deal with this is to 
> have the leader relinquish leadership if it detects a big jump forward in 
> time. when a new leader gets elected, it will recalculate timeouts of active 
> sessions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to