[
https://issues.apache.org/jira/browse/HDDS-3459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
runzhiwang updated HDDS-3459:
-----------------------------
Description:
*What's the problem ?*
There are 3 datanodes in a group: leader, follower1, follower2.
1. follower2 report close pipeline
2. scm send close pipeline command
3. leader and follower1 remove group, but follower2 socket timeout and does not
remove group
4. follower2 then begin infinite LeaderElection at least 6 hours, leader and
follower1 response group not found
You can see find it in following screenshot.
1. follower2 report close pipeline
!screenshot-8.png!
2. Scm close pipeline:
!screenshot-9.png!
!screenshot-10.png!
3. leader remove group
!screenshot-11.png!
follower1 remove group
!screenshot-12.png!
follower2 socket timeout
!screenshot-13.png!
4. follower2 then begin infinite LeaderElection at least 6 hours
!screenshot-14.png!
was:
*What's the problem ?*
There are 3 datanodes in a group: leader, follower1, follower2, then I find the
follower1 and leader both remove a group, but follower2 does not remove the
group. Then follower2 begin infinite LeaderElection, but the leader and
follower1 response group not found.
Follow2 report close pipeline:
!screenshot-7.png!
Scm close pipeline:
!screenshot-4.png!
!screenshot-5.png!
Leader remove group:
!screenshot-2.png!
Follower1 remove group:
!screenshot-3.png!
Follower2 cannot connect to scm:
!screenshot-6.png!
Follower2 infinite LeaderElection:
!screenshot-1.png!
> Infinite group not found
> ------------------------
>
> Key: HDDS-3459
> URL: https://issues.apache.org/jira/browse/HDDS-3459
> Project: Hadoop Distributed Data Store
> Issue Type: Bug
> Reporter: runzhiwang
> Assignee: runzhiwang
> Priority: Major
> Attachments: screenshot-1.png, screenshot-10.png, screenshot-11.png,
> screenshot-12.png, screenshot-13.png, screenshot-14.png, screenshot-2.png,
> screenshot-3.png, screenshot-4.png, screenshot-5.png, screenshot-6.png,
> screenshot-7.png, screenshot-8.png, screenshot-9.png
>
>
> *What's the problem ?*
> There are 3 datanodes in a group: leader, follower1, follower2.
> 1. follower2 report close pipeline
> 2. scm send close pipeline command
> 3. leader and follower1 remove group, but follower2 socket timeout and does
> not remove group
> 4. follower2 then begin infinite LeaderElection at least 6 hours, leader and
> follower1 response group not found
> You can see find it in following screenshot.
> 1. follower2 report close pipeline
> !screenshot-8.png!
> 2. Scm close pipeline:
> !screenshot-9.png!
> !screenshot-10.png!
> 3. leader remove group
> !screenshot-11.png!
> follower1 remove group
> !screenshot-12.png!
> follower2 socket timeout
> !screenshot-13.png!
> 4. follower2 then begin infinite LeaderElection at least 6 hours
> !screenshot-14.png!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]