[
https://issues.apache.org/jira/browse/HDDS-3459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
runzhiwang updated HDDS-3459:
-----------------------------
Description:
*What's the problem ?*
There are 3 datanodes in a group: leader, follower1, follower2. Steps to
reproduce the problem are as following:
1. follower2 report close pipeline
2. scm send close pipeline command
3. leader and follower1 remove group, but follower2 socket timeout and does not
remove group
4. follower2 then begin infinite LeaderElection at least 6 hours, leader and
follower1 response group not found
You can find it in following screenshot.
1. follower2 report close pipeline
!screenshot-8.png!
2. Scm close pipeline:
!screenshot-9.png!
!screenshot-10.png!
3. leader remove group
!screenshot-11.png!
follower1 remove group
!screenshot-12.png!
follower2 socket timeout
!screenshot-13.png!
4. follower2 then begin infinite LeaderElection at least 6 hours
!screenshot-14.png!
was:
*What's the problem ?*
There are 3 datanodes in a group: leader, follower1, follower2. Steps to
reproduce the problem are as following:
1. follower2 report close pipeline
2. scm send close pipeline command
3. leader and follower1 remove group, but follower2 socket timeout and does not
remove group
4. follower2 then begin infinite LeaderElection at least 6 hours, leader and
follower1 response group not found
You can see find it in following screenshot.
1. follower2 report close pipeline
!screenshot-8.png!
2. Scm close pipeline:
!screenshot-9.png!
!screenshot-10.png!
3. leader remove group
!screenshot-11.png!
follower1 remove group
!screenshot-12.png!
follower2 socket timeout
!screenshot-13.png!
4. follower2 then begin infinite LeaderElection at least 6 hours
!screenshot-14.png!
> Infinite leader election
> ------------------------
>
> Key: HDDS-3459
> URL: https://issues.apache.org/jira/browse/HDDS-3459
> Project: Hadoop Distributed Data Store
> Issue Type: Bug
> Reporter: runzhiwang
> Assignee: runzhiwang
> Priority: Major
> Attachments: screenshot-1.png, screenshot-10.png, screenshot-11.png,
> screenshot-12.png, screenshot-13.png, screenshot-14.png, screenshot-2.png,
> screenshot-3.png, screenshot-4.png, screenshot-5.png, screenshot-6.png,
> screenshot-7.png, screenshot-8.png, screenshot-9.png
>
>
> *What's the problem ?*
> There are 3 datanodes in a group: leader, follower1, follower2. Steps to
> reproduce the problem are as following:
> 1. follower2 report close pipeline
> 2. scm send close pipeline command
> 3. leader and follower1 remove group, but follower2 socket timeout and does
> not remove group
> 4. follower2 then begin infinite LeaderElection at least 6 hours, leader and
> follower1 response group not found
> You can find it in following screenshot.
> 1. follower2 report close pipeline
> !screenshot-8.png!
> 2. Scm close pipeline:
> !screenshot-9.png!
> !screenshot-10.png!
> 3. leader remove group
> !screenshot-11.png!
> follower1 remove group
> !screenshot-12.png!
> follower2 socket timeout
> !screenshot-13.png!
> 4. follower2 then begin infinite LeaderElection at least 6 hours
> !screenshot-14.png!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]