[ https://issues.apache.org/jira/browse/ZOOKEEPER-366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12932708#action_12932708 ]
Patrick Hunt commented on ZOOKEEPER-366: ---------------------------------------- FYI, this came up again today on hbase list: 14:59 < _hp_> man this system time update on a bunch of machines causing zookeeper session timeouts causing hr's to die is really taking its toll, count on a table now hangs, i disabled and enabled the table, tried count again, and it hangs at the same place still. Arg. Ben any progress on this? Should we try to get it into 3.3.3? > Session timeout detection can go wrong if the leader system time changes > ------------------------------------------------------------------------ > > Key: ZOOKEEPER-366 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-366 > Project: Zookeeper > Issue Type: Bug > Reporter: Benjamin Reed > Assignee: Benjamin Reed > Attachments: ZOOKEEPER-366.patch > > > the leader tracks session expirations by calculating when a session will > timeout and then periodically checking to see what needs to be timed out > based on the current time. this works great as long as the leaders clock > progresses at a steady pace. the problem comes when there are big (session > size) changes in clock, by ntp for example. if time gets adjusted forward, > all the sessions could timeout immediately. if time goes backward sessions > that should timeout may take a lot longer to actually expire. > this is really just a leader issue. the easiest way to deal with this is to > have the leader relinquish leadership if it detects a big jump forward in > time. when a new leader gets elected, it will recalculate timeouts of active > sessions. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.