[
https://issues.apache.org/jira/browse/HBASE-7686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13603805#comment-13603805
]
Ted Yu commented on HBASE-7686:
-------------------------------
I noticed stack trace similar to the following when I opened HBASE-8116
https://builds.apache.org/view/G-L/view/HBase/job/HBase-TRUNK/3961/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testShutdownFixupWhenDaughterHasSplit/
{code}
Potentially hanging thread:
juno.apache.org,41160,1363344239519-daughterOpener=20e42fffd4bdbbaf15822dfd60c58371
java.lang.Object.wait(Native Method)
java.lang.Object.wait(Object.java:503)
org.apache.hadoop.ipc.Client.call(Client.java:1093)
org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:229)
$Proxy10.delete(Unknown Source)
sun.reflect.GeneratedMethodAccessor28.invoke(Unknown Source)
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
java.lang.reflect.Method.invoke(Method.java:601)
org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:85)
org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:62)
$Proxy10.delete(Unknown Source)
sun.reflect.GeneratedMethodAccessor28.invoke(Unknown Source)
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
java.lang.reflect.Method.invoke(Method.java:601)
org.apache.hadoop.hbase.fs.HFileSystem$1.invoke(HFileSystem.java:267)
$Proxy19.delete(Unknown Source)
org.apache.hadoop.hdfs.DFSClient.delete(DFSClient.java:981)
org.apache.hadoop.hdfs.DistributedFileSystem.delete(DistributedFileSystem.java:245)
org.apache.hadoop.fs.FilterFileSystem.delete(FilterFileSystem.java:154)
org.apache.hadoop.hbase.util.FSUtils.deleteDirectory(FSUtils.java:166)
org.apache.hadoop.hbase.regionserver.HRegionFileSystem.cleanupTempDir(HRegionFileSystem.java:119)
org.apache.hadoop.hbase.regionserver.HRegion.initializeRegionInternals(HRegion.java:571)
org.apache.hadoop.hbase.regionserver.HRegion.initialize(HRegion.java:546)
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4041)
org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughterRegion(SplitTransaction.java:520)
org.apache.hadoop.hbase.regionserver.SplitTransaction$DaughterOpener.run(SplitTransaction.java:501)
...
Potentially hanging thread:
juno.apache.org,41160,1363344239519-daughterOpener=58f565d35c886dce2bcf997cf247611b
java.lang.Object.wait(Native Method)
java.lang.Object.wait(Object.java:503)
org.apache.hadoop.ipc.Client.call(Client.java:1093)
org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:229)
$Proxy10.delete(Unknown Source)
sun.reflect.GeneratedMethodAccessor28.invoke(Unknown Source)
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
java.lang.reflect.Method.invoke(Method.java:601)
org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:85)
org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:62)
$Proxy10.delete(Unknown Source)
sun.reflect.GeneratedMethodAccessor28.invoke(Unknown Source)
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
java.lang.reflect.Method.invoke(Method.java:601)
org.apache.hadoop.hbase.fs.HFileSystem$1.invoke(HFileSystem.java:267)
$Proxy19.delete(Unknown Source)
org.apache.hadoop.hdfs.DFSClient.delete(DFSClient.java:981)
org.apache.hadoop.hdfs.DistributedFileSystem.delete(DistributedFileSystem.java:245)
org.apache.hadoop.fs.FilterFileSystem.delete(FilterFileSystem.java:154)
org.apache.hadoop.hbase.util.FSUtils.deleteDirectory(FSUtils.java:166)
org.apache.hadoop.hbase.regionserver.HRegionFileSystem.cleanupTempDir(HRegionFileSystem.java:119)
org.apache.hadoop.hbase.regionserver.HRegion.initializeRegionInternals(HRegion.java:571)
org.apache.hadoop.hbase.regionserver.HRegion.initialize(HRegion.java:546)
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4041)
org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughterRegion(SplitTransaction.java:520)
org.apache.hadoop.hbase.regionserver.SplitTransaction$DaughterOpener.run(SplitTransaction.java:501)
{code}
> TestSplitTransactionOnCluster fails occasionally in trunk builds
> ----------------------------------------------------------------
>
> Key: HBASE-7686
> URL: https://issues.apache.org/jira/browse/HBASE-7686
> Project: HBase
> Issue Type: Bug
> Reporter: Ted Yu
> Priority: Critical
> Fix For: 0.95.0
>
> Attachments: HBASE-7686-v0.patch, HBASE-7686-v1.patch
>
>
> From trunk build #3808:
> {code}
> testShouldFailSplitIfZNodeDoesNotExistDueToPrevRollBack(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
> test timed out after 20000 milliseconds
>
> testMasterRestartWhenSplittingIsPartial(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
> test timed out after 300000 milliseconds
>
> testExistingZnodeBlocksSplitAndWeRollback(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
> test timed out after 300000 milliseconds
> {code}
> From HBase-TRUNK-on-Hadoop-2.0.0 #378 :
> {code}
> testShutdownSimpleFixup(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
> Region not moved off .META. server
>
> testShouldFailSplitIfZNodeDoesNotExistDueToPrevRollBack(org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster):
> test timed out after 20000 milliseconds
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira