Todd Lipcon created KUDU-1407:
---------------------------------

             Summary: Leader should not evict a follower when the follower is 
in the process of starting up a tablet
                 Key: KUDU-1407
                 URL: https://issues.apache.org/jira/browse/KUDU-1407
             Project: Kudu
          Issue Type: Bug
          Components: consensus
    Affects Versions: 0.8.0
            Reporter: Todd Lipcon
            Assignee: Todd Lipcon
            Priority: Critical


It seems like, if the leader gets an error from one of its followers because 
the tablet is not running, it considers this replica to be 'unresponsive'. If 
this happens for 5 minutes, it will evict that follower to try to create a new 
replica.

This can cause problems at cluster startup time when there is a lot of data and 
a cold disk cache - the startup bootstrap process might be more than five 
minutes and leaders might end up evicting followers that are perfectly healthy 
(just in the process of coming up).




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to