In that case, no leader will be elected until at least 1 replica in ISR comes back.
Thanks, Jun On Wed, Nov 19, 2014 at 6:58 PM, Jason Rosenberg <j...@squareup.com> wrote: > What if it never comes back with unclean leader election disabled (but > another broker does come back)? > > On Wed, Nov 19, 2014 at 9:32 PM, Jun Rao <jun...@gmail.com> wrote: > > > In that case, we just wait until the broker in ISR is back and make it > the > > leader and take whatever data is has. > > > > Thanks, > > > > Jun > > > > On Tue, Nov 18, 2014 at 10:36 PM, Jason Rosenberg <j...@squareup.com> > > wrote: > > > > > Ok, > > > > > > Makes sense. But if the node is not actually healthy (and underwent a > > hard > > > crash) it would likely not be able to avoid an 'unclean' > restart.....what > > > happens if unclean leader election is disabled, but there are no > 'clean' > > > partitions available? > > > > > > Jason > > > > > > On Wed, Nov 19, 2014 at 12:40 AM, Jun Rao <jun...@gmail.com> wrote: > > > > > > > Yes, we will preserve the last replica in ISR. This way, we know > which > > > > replica has all committed messages and can wait for it to come back > as > > > the > > > > leader, if unclean leader election is disabled. > > > > > > > > Thanks, > > > > > > > > Jun > > > > > > > > On Mon, Nov 17, 2014 at 11:06 AM, Jason Rosenberg <j...@squareup.com> > > > > wrote: > > > > > > > > > We have had 2 nodes in a 4 node cluster die this weekend, sadly. > > > > > Fortunately there was no critical data on these machines yet. > > > > > > > > > > The cluster is running 0.8.1.1, and using replication factor of 2 > > for 2 > > > > > topics, each with 20 partitions. > > > > > > > > > > For sake of discussion, assume that nodes A and B are still up, > and C > > > > and D > > > > > are now down. > > > > > > > > > > As expected, partitions that had one replica on a good host (A or > B) > > > and > > > > > one on a bad node (C or D), had their ISR shrink to just 1 node (A > or > > > B). > > > > > > > > > > Roughly 1/6 of the partitions had their 2 replicas on the 2 bad > > nodes, > > > C > > > > > and D. For these, I was expecting the ISR to show up as empty, and > > the > > > > > partition unavailable. > > > > > > > > > > However, that's not what I'm seeing. When running TopicCommand > > > > --describe, > > > > > I see that the ISR still shows 1 replica, on node D (D was the > second > > > > node > > > > > to go down). > > > > > > > > > > And, producers are still periodically trying to produce to node D > > (but > > > > > failing and retrying to one of the good nodes). > > > > > > > > > > So, it seems the cluster's meta data is still thinking that node D > is > > > up > > > > > and serving the partitions that were only replicated on C and D. > > > > However, > > > > > for partitions that were on A and D, or B and D, D is not shown as > > > being > > > > in > > > > > the ISR. > > > > > > > > > > Is this correct? Should the cluster continue showing the last node > > to > > > > have > > > > > been alive for a partition as still in the ISR? > > > > > > > > > > Jason > > > > > > > > > > > > > > >