[
https://issues.apache.org/jira/browse/HBASE-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12875269#action_12875269
]
stack commented on HBASE-2614:
------------------------------
I see (Too many open files) in this log.
A few other things of interest are that the Master won't go down because it
thinks there is still a regionserver alive. Its stuck here.
{code}
Thread 144 (RegionManager.rootScanner):
State: TIMED_WAITING
Blocked count: 63
Waited count: 333
Stack:
java.lang.Object.wait(Native Method)
org.apache.hadoop.hbase.master.RegionManager.waitForRootRegionLocation(RegionManager.java:1161)
org.apache.hadoop.hbase.master.RootScanner.scanRoot(RootScanner.java:45)
org.apache.hadoop.hbase.master.RootScanner.maintenanceScan(RootScanner.java:79)
org.apache.hadoop.hbase.master.BaseScanner.chore(BaseScanner.java:154)
org.apache.hadoop.hbase.Chore.run(Chore.java:68)
{code}
Other issue is that in our minihbasecluster, when we are asked to start a new
server, we'll wait till its online only here, the new regionserver crashed on
startup. Need to insert isAlive check into loop waiting on new server to come
online. Patch coming.
> killing server in TestMasterTransitions causes NPEs and test deadlock
> ---------------------------------------------------------------------
>
> Key: HBASE-2614
> URL: https://issues.apache.org/jira/browse/HBASE-2614
> Project: HBase
> Issue Type: Bug
> Reporter: Andrew Purtell
> Fix For: 0.21.0
>
> Attachments:
> org.apache.hadoop.hbase.master.TestMasterTransitions-output.txt.gz
>
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.