[
https://issues.apache.org/jira/browse/HBASE-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12663585#action_12663585
]
Andrew Purtell commented on HBASE-1124:
---------------------------------------
The lease timeouts got me wondering so I ran Wireshark and looked over some
packet traces. The lease timeouts are legit. Can't blame master if HRS are not
contacting it in time.
HRPC requires prompt name resolution when (re)establishing connections for IPC.
Affects all aspects of system operation: HBase heartbeats, DFS block shuffling
and replication, etc. Increase DNS resolver latency and HDFS and HBase become
unstable.
Root cause might just be overloaded DNS servers -- BIND cache too large,
swapping. Taking steps now, will monitor to see what happens.
> Balancer kicks in way too early
> -------------------------------
>
> Key: HBASE-1124
> URL: https://issues.apache.org/jira/browse/HBASE-1124
> Project: Hadoop HBase
> Issue Type: Bug
> Reporter: Andrew Purtell
> Fix For: 0.19.0
>
>
> Balancer kicks in before all regions are assigned out. Causes confusion.
> Master won't accept OPENs from "overloaded" HRS. Master is slow to respond to
> UI and HRS during. Master sometimes takes too long to respond to a HRS
> heartbeat and so the HRS will reinit. This causes more confusion.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.