[
https://issues.apache.org/jira/browse/HBASE-9254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13748258#comment-13748258
]
Ted Yu commented on HBASE-9254:
-------------------------------
Thanks for taking a look, Jon.
The following snippet of stack trace might be related:
{code}
"NamespaceJanitor-juno:37014" daemon prio=10 tid=0x75085800 nid=0x3471 waiting
for monitor entry [0x6d1ad000]
java.lang.Thread.State: BLOCKED (on object monitor)
at
org.apache.hadoop.hbase.master.TableNamespaceManager.list(TableNamespaceManager.java:183)
- waiting to lock <0x7fb83a10> (a
org.apache.hadoop.hbase.master.TableNamespaceManager)
at
org.apache.hadoop.hbase.master.HMaster.listNamespaceDescriptors(HMaster.java:3120)
at
org.apache.hadoop.hbase.master.NamespaceJanitor.removeOrphans(NamespaceJanitor.java:103)
at
org.apache.hadoop.hbase.master.NamespaceJanitor.chore(NamespaceJanitor.java:87)
{code}
Since list() and TableNamespaceManager.get() are synchronized methods, their
interaction might have made the test run longer than expected.
bq. can you make that a sub-issue/workaround
Will do.
bq. keep this open until you find the root cause?
Sure.
> TestHBaseFsck occasionally hung
> -------------------------------
>
> Key: HBASE-9254
> URL: https://issues.apache.org/jira/browse/HBASE-9254
> Project: HBase
> Issue Type: Test
> Affects Versions: 0.95.2
> Reporter: Ted Yu
> Assignee: Ted Yu
> Attachments: 9254-v1.txt
>
>
> From https://builds.apache.org/job/hbase-0.95-on-hadoop2/247/console :
> {code}
> "pool-1-thread-1" prio=10 tid=0x73a2a400 nid=0x2f4d in Object.wait()
> [0x73bdd000]
> java.lang.Thread.State: TIMED_WAITING (on object monitor)
> at java.lang.Object.wait(Native Method)
> at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1412)
> - locked <0xccdd8898> (a org.apache.hadoop.hbase.ipc.RpcClient$Call)
> at
> org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1630)
> at
> org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1687)
> at
> org.apache.hadoop.hbase.protobuf.generated.MasterAdminProtos$MasterAdminService$BlockingStub.createTable(MasterAdminProtos.java:29365)
> at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$5.createTable(HConnectionManager.java:1996)
> at org.apache.hadoop.hbase.client.HBaseAdmin$2.call(HBaseAdmin.java:590)
> at org.apache.hadoop.hbase.client.HBaseAdmin$2.call(HBaseAdmin.java:586)
> at
> org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:120)
> - locked <0x81c8abb0> (a
> org.apache.hadoop.hbase.client.RpcRetryingCaller)
> at
> org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:98)
> - locked <0x81c8abb0> (a
> org.apache.hadoop.hbase.client.RpcRetryingCaller)
> at
> org.apache.hadoop.hbase.client.HBaseAdmin.executeCallable(HBaseAdmin.java:3087)
> at
> org.apache.hadoop.hbase.client.HBaseAdmin.createTableAsync(HBaseAdmin.java:586)
> at
> org.apache.hadoop.hbase.client.HBaseAdmin.createTable(HBaseAdmin.java:477)
> at
> org.apache.hadoop.hbase.util.TestHBaseFsck.setupTable(TestHBaseFsck.java:338)
> at
> org.apache.hadoop.hbase.util.TestHBaseFsck.testSplitDaughtersNotInMeta(TestHBaseFsck.java:1362)
> ...
> {color:red}-1 core zombie tests{color}. There are 1 zombie test(s):
> at
> org.apache.hadoop.hbase.util.TestHBaseFsck.testSplitDaughtersNotInMeta(TestHBaseFsck.java:1362)'
> {code}
> I looked at
> https://builds.apache.org/job/hbase-0.95-on-hadoop2/247/artifact/0.95-on-hadoop2/hbase-server/target/surefire-reports/,
> there was no test output.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira