[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-6989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14042870#comment-14042870
 ] 

ASF subversion and git services commented on CLOUDSTACK-6989:
-------------------------------------------------------------

Commit a6381f5809253c4fe2a2a6ac932a4d8861de8de2 in cloudstack's branch 
refs/heads/4.4-forward from [~yasker]
[ https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;h=a6381f5 ]

CLOUDSTACK-6989: Add 3 strikes rule for RvR freezing detection

Sometime in VR ntpd would move time backward to keep sync with NTP server, which
can result in false alarm of keepalived monitering process.

This patch adds 3 strikes for keepalived process dead detection to avoid falsely
shutdown keepalived process due to time adjustment for only once.


> Redunant virtual router(RvR): When time synced by ntpd, RvR may result in 
> FAULT state
> -------------------------------------------------------------------------------------
>
>                 Key: CLOUDSTACK-6989
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-6989
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the 
> default.) 
>          Components: Virtual Router
>    Affects Versions: 4.3.0, 4.4.0
>            Reporter: Sheng Yang
>            Assignee: Sheng Yang
>            Priority: Critical
>             Fix For: 4.4.0
>
>
> ntpd in VR can change time backward to sync with remote NTP server. But the 
> check_heartbeat.sh for keepalived process in RvR depends on time to know if 
> keepalived keeps writing to heartbeat file. ntpd rewinds the time result in 
> check_heartbeat.sh misjudge the situation thus stop the keepalived process.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to