[
https://issues.apache.org/jira/browse/HBASE-3331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090942#comment-13090942
]
Sudharsan Sampath commented on HBASE-3331:
------------------------------------------
Its more related to the META region only. Followimg debug info is printed
before throwing exception
2011-08-25 12:46:52,443 DEBUG
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation:
locateRegionInMeta parentTable=-ROOT-, metaLocation=address: sb6270x1664:60020,
regioninfo: -ROOT-,,0.70236052, attempt=0 of 10 failed; retrying after sleep of
1000 because: Connection refused
> Kill -STOP of RS hosting META does not recover
> ----------------------------------------------
>
> Key: HBASE-3331
> URL: https://issues.apache.org/jira/browse/HBASE-3331
> Project: HBase
> Issue Type: Bug
> Affects Versions: 0.90.0
> Reporter: Todd Lipcon
> Priority: Critical
> Fix For: 0.92.0
>
> Attachments: timeouts.log.txt
>
>
> If you find the server hosting META and kill -STOP its region server, it will
> eventually lose its ZK session and the master will split its logs and try to
> reassign. However, at some point along here it tries to access the old META,
> and gets SocketTimeoutExceptions, which cause it to keep retrying forever.
> Once I kill -9ed the stopped server, things came back to life.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira