[ https://issues.apache.org/jira/browse/HBASE-11747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16655778#comment-16655778 ]
stack commented on HBASE-11747: ------------------------------- [~apurtell] Thanks for chiming in. Agree. This is the RS heartbeating the Master. I took a look at Master on startup doing bulk assigning and all it is doing is parse of the heartbeat report. This needs some loving. Long time ago, our Elliott suggested keeping state in a table: HBASE-7057. There may be some overlap with trying to keep stats/heatmap... but at same time, volunteering 100MB to Master every second x 100s of RS is off. Needs to go on a diet. Needs to be pull vs push, and so on. > ClusterStatus (heartbeat) is too bulky > --------------------------------------- > > Key: HBASE-11747 > URL: https://issues.apache.org/jira/browse/HBASE-11747 > Project: HBase > Issue Type: Sub-task > Reporter: Virag Kothari > Priority: Critical > Attachments: exceptiontrace > > > Following exception on 0.98 with 1M regions on cluster with 160 region servers > {code} > Caused by: java.io.IOException: Call to regionserverhost:port failed on local > exception: com.google.protobuf.InvalidProtocolBufferException: Protocol > message was too large. May be malicious. Use > CodedInputStream.setSizeLimit() to increase the size limit. > at > org.apache.hadoop.hbase.ipc.RpcClient.wrapException(RpcClient.java:1482) > at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1454) > at > org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1654) > at > org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1712) > at > org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$BlockingStub.getClusterStatus(MasterProtos.java:42555) > at > org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$5.getClusterStatus(HConnectionManager.java:2132) > at > org.apache.hadoop.hbase.client.HBaseAdmin$16.call(HBaseAdmin.java:2166) > at > org.apache.hadoop.hbase.client.HBaseAdmin$16.call(HBaseAdmin.java:2162) > at > org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:114) > ... 43 more > Caused by: com.google.protobuf.InvalidProtocolBufferException: Protocol > message was too large. May be malicious. Use > CodedInputStream.setSizeLimit() to increase the size limit. > at > com.google.protobuf.InvalidProtocolBufferException.sizeLimitExceeded(InvalidProtocolBufferException.java:110) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)