Rushabh Shah created HBASE-28293:
------------------------------------
Summary: Add metric for GetClusterStatus request count.
Key: HBASE-28293
URL: https://issues.apache.org/jira/browse/HBASE-28293
Project: HBase
Issue Type: Bug
Reporter: Rushabh Shah
We have been bitten multiple times by GetClusterStatus request overwhelming
HMaster's memory usage. It would be good to add a metric for the total
GetClusterStatus requests count.
In almost all of our production incidents involving GetClusterStatus request,
HMaster were running out of memory with many clients call this RPC in parallel
and the response size is very big.
In hbase2 we have
[ClusterMetrics.Option|https://github.com/apache/hbase/blob/master/hbase-client/src/main/java/org/apache/hadoop/hbase/ClusterMetrics.java#L164-L224]
which can reduce the size of the response.
It would be nice to add another metric to indicate if the response size of
GetClusterStatus is greater than some threshold (like 5MB)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)