Jean-Daniel Cryans created KUDU-1959:
----------------------------------------
Summary: Hard to tell when a cluster is done starting up
Key: KUDU-1959
URL: https://issues.apache.org/jira/browse/KUDU-1959
Project: Kudu
Issue Type: Improvement
Reporter: Jean-Daniel Cryans
Restarting a cluster that has a good amount of data, it's hard to tell when
it's "done". Right now the things I do:
- Run ksck, wait until most tablets are not in "unavailable" or "boostrapping"
state.
- Watch the metrics and see when the data under management is close to where
it was before restarting (it grows as tablets are getting bootstrapped).
- Look at the tablet server web UIs for tablets, compare how many are done
bootstrapping VS in the process of VS not started.
Ideas on how to improve this:
- In the master's web UI for tablet servers, show how many tablets are running
VS not running (I wouldn't add anything about tombstoned tablets)
- Add metrics for tablets in different states.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)