[ 
https://issues.apache.org/jira/browse/HBASE-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12875269#action_12875269
 ] 

stack commented on HBASE-2614:
------------------------------

I see  (Too many open files) in this log.

A few other things of interest are that the Master won't go down because it 
thinks there is still a regionserver alive.  Its stuck here.
{code}
Thread 144 (RegionManager.rootScanner):
  State: TIMED_WAITING
  Blocked count: 63
  Waited count: 333
  Stack:
    java.lang.Object.wait(Native Method)
    
org.apache.hadoop.hbase.master.RegionManager.waitForRootRegionLocation(RegionManager.java:1161)
    org.apache.hadoop.hbase.master.RootScanner.scanRoot(RootScanner.java:45)
    
org.apache.hadoop.hbase.master.RootScanner.maintenanceScan(RootScanner.java:79)
    org.apache.hadoop.hbase.master.BaseScanner.chore(BaseScanner.java:154)
    org.apache.hadoop.hbase.Chore.run(Chore.java:68)
{code}

Other issue is that in our minihbasecluster, when we are asked to start a new 
server, we'll wait till its online only here, the new regionserver crashed on 
startup.  Need to insert isAlive check into loop waiting on new server to come 
online.   Patch coming.

> killing server in TestMasterTransitions causes NPEs and test deadlock
> ---------------------------------------------------------------------
>
>                 Key: HBASE-2614
>                 URL: https://issues.apache.org/jira/browse/HBASE-2614
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Andrew Purtell
>             Fix For: 0.21.0
>
>         Attachments: 
> org.apache.hadoop.hbase.master.TestMasterTransitions-output.txt.gz
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to