Hello,

I have three questions related to Hbase major compactions:

1) During a scheduled maintenance event on the Hbase cluster to add 2 new
regionservers, Ambari said restart of all HDFS nodes (both name and data)
was required. In the logs, it looks like the Hbase balancer turned on
actively after the two nodes got registered.
Is it normal to restart all HDFS nodes to add a new node into the cluster?
I am using HDP 2.4.

2) Should I turn off the Hbase balancer before adding new nodes. If so,
when should I turn it back on and what would be the impact? Would it cause
a large drop in locality again?

3) When all the nodes in the cluster were restarted with Ambari, locality
dropped to ~13% and Hbase was almost non-responsive. Only triggering a
manual major compaction seems to help improve the locality after this. But
the data-locality increase is very gradual (about 4% every hour). Is there
any way to speed up major compaction (increase the number of threads etc)
in HDP distribution?


Thanks,
Ganesh

Reply via email to