Hello Kudu developers! There seems to be a decent amount of interest in a few features relating to the controlled downtime of tablet servers. To name a few that I've seen gathering interest, there is:
- Tablet server maintenance mode (KUDU-2069) - Tablet server replica draining / tablet server decommissioning (KUDU-1827, KUDU-2914) - Unregistering a tablet server from the master (KUDU-2915) - Cluster rolling restart (KUDU-2054) - Automatic cluster rebalancing (KUDU-2780) So, I wanted to start a central document to organize my thoughts on each of these, in hopes that it might steer these features in a more unified direction. Please take a look, if you're interested. I'm open to feedback and discussion. https://docs.google.com/document/d/12BZqspGjHvQlc-o8XTDixoRol9Q36WJzXLJ6p15Zhf0/edit?usp=sharing Thanks! Andrew Wong
