[ 
https://issues.apache.org/jira/browse/HDDS-3459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

runzhiwang updated HDDS-3459:
-----------------------------
    Description: 
*What's the problem ?*
There are 3 datanodes in a group: leader, follower1, follower2.
1. follower2 report close pipeline
2. scm send close pipeline command
3. leader and follower1 remove group, but follower2 socket timeout and does not 
remove group
4.  follower2 then begin infinite LeaderElection at least 6 hours, leader and 
follower1 response group not found

You can see find it in following screenshot.
1. follower2 report close pipeline
 !screenshot-8.png! 
2. Scm close pipeline:
 !screenshot-9.png! 
 !screenshot-10.png! 
3. leader remove group
 !screenshot-11.png! 

   follower1 remove group
 !screenshot-12.png! 

 follower2 socket timeout
 !screenshot-13.png! 

4. follower2 then begin infinite LeaderElection at least 6 hours
 !screenshot-14.png! 

  was:
*What's the problem ?*
There are 3 datanodes in a group: leader, follower1, follower2, then I find the 
follower1 and leader both remove a group, but follower2 does not remove the 
group. Then follower2 begin infinite LeaderElection, but the leader and 
follower1 response group not found. 

Follow2 report close pipeline:
 !screenshot-7.png! 
Scm close pipeline:
 !screenshot-4.png! 
 !screenshot-5.png! 
Leader remove group:

 !screenshot-2.png! 
Follower1 remove group:
 !screenshot-3.png! 

Follower2 cannot connect to scm:

 !screenshot-6.png! 
Follower2 infinite LeaderElection:
 !screenshot-1.png! 


> Infinite group not found
> ------------------------
>
>                 Key: HDDS-3459
>                 URL: https://issues.apache.org/jira/browse/HDDS-3459
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>            Reporter: runzhiwang
>            Assignee: runzhiwang
>            Priority: Major
>         Attachments: screenshot-1.png, screenshot-10.png, screenshot-11.png, 
> screenshot-12.png, screenshot-13.png, screenshot-14.png, screenshot-2.png, 
> screenshot-3.png, screenshot-4.png, screenshot-5.png, screenshot-6.png, 
> screenshot-7.png, screenshot-8.png, screenshot-9.png
>
>
> *What's the problem ?*
> There are 3 datanodes in a group: leader, follower1, follower2.
> 1. follower2 report close pipeline
> 2. scm send close pipeline command
> 3. leader and follower1 remove group, but follower2 socket timeout and does 
> not remove group
> 4.  follower2 then begin infinite LeaderElection at least 6 hours, leader and 
> follower1 response group not found
> You can see find it in following screenshot.
> 1. follower2 report close pipeline
>  !screenshot-8.png! 
> 2. Scm close pipeline:
>  !screenshot-9.png! 
>  !screenshot-10.png! 
> 3. leader remove group
>  !screenshot-11.png! 
>    follower1 remove group
>  !screenshot-12.png! 
>  follower2 socket timeout
>  !screenshot-13.png! 
> 4. follower2 then begin infinite LeaderElection at least 6 hours
>  !screenshot-14.png! 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to