[
https://issues.apache.org/jira/browse/HBASE-13405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14395212#comment-14395212
]
Mikhail Antonov commented on HBASE-13405:
-----------------------------------------
In fact I can see very similar failures in other test cases in this class, e.g.
{code}
2015-04-03 15:48:31,330 WARN [Thread-5524] util.HBaseFsck(1759): Could not
process regionserver 10.1.4.219:50028
org.apache.hadoop.hbase.ipc.CallTimeoutException: callId: 7 methodName:
GetOnlineRegion param {TODO: class
org.apache.hadoop.hbase.protobuf.generated.AdminProtos$GetOnlineRegionRequest}
at
org.apache.hadoop.hbase.ipc.AsyncRpcClient.call(AsyncRpcClient.java:235)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:213)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:287)
at
org.apache.hadoop.hbase.protobuf.generated.AdminProtos$AdminService$BlockingStub.getOnlineRegion(AdminProtos.java:22869)
at
org.apache.hadoop.hbase.protobuf.ProtobufUtil.getOnlineRegions(ProtobufUtil.java:1784)
at
org.apache.hadoop.hbase.util.HBaseFsck$WorkItemRegion.call(HBaseFsck.java:3895)
at
org.apache.hadoop.hbase.util.HBaseFsck$WorkItemRegion.call(HBaseFsck.java:3874)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:178)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:292)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:744)
2015-04-03 15:48:29,727 DEBUG [RS_CLOSE_META-10.1.4.219:50028-0]
regionserver.HRegionFileSystem(381): Committing store file
hdfs://localhost:50002/user/mantonov/test-data/539995fc-b1fe-452f-9f50-42d07acc78ac/data/hbase/meta/1588230740/.tmp/e468fb98da9241e2aeaf47d0b92e2cc7
as
hdfs://localhost:50002/user/mantonov/test-data/539995fc-b1fe-452f-9f50-42d07acc78ac/data/hbase/meta/1588230740/table/e468fb98da9241e2aeaf47d0b92e2cc7
2015-04-03 15:48:29,726 INFO
[RpcServer.reader=1,bindAddress=10.1.4.219,port=50028]
ipc.RpcServer$Connection(1640): Connection from 10.1.4.219 port: 53374 with
version info: version: "2.0.0-SNAPSHOT" url:
"git://Mikhails-MacBook-Pro.local/wandisco/source/hbase" revision:
"d8b10656d00779e194c3caca118995136babce99" user: "mantonov" date: "Fri Apr 3
15:22:53 PDT 2015" src_checksum: "e261fe7369b6136e325d351242d32004"
java.lang.AssertionError:
Expected :[UNKNOWN, NO_META_REGION, NULL_META_REGION]
Actual :[UNKNOWN, NO_META_REGION, NULL_META_REGION, RS_CONNECT_FAILURE,
RS_CONNECT_FAILURE]
<Click to see difference>
at org.junit.Assert.fail(Assert.java:88)
at org.junit.Assert.failNotEquals(Assert.java:743)
at org.junit.Assert.assertEquals(Assert.java:118)
at org.junit.Assert.assertEquals(Assert.java:144)
at
org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.assertErrors(HbckTestingUtil.java:100)
at
org.apache.hadoop.hbase.util.TestHBaseFsck.testFixAssignmentsWhenMETAinTransition(TestHBaseFsck.java:278)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
at
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
at
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at
org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
{code}
> TestHBaseFsck is flaky
> ----------------------
>
> Key: HBASE-13405
> URL: https://issues.apache.org/jira/browse/HBASE-13405
> Project: HBase
> Issue Type: Bug
> Components: test
> Affects Versions: 2.0.0
> Reporter: Mikhail Antonov
>
> Once in a while I'm seeing the following, running #testContainedRegionOverlap
> test in IDE after clean install (mac osx, hbase master):
> {code}
> regionserver.HRegionServer(1863): Post open deploy tasks for
> tableContainedRegionOverlap,A,1428099123733.03a139b02119e99ef08149addd9a7996.
> 2015-04-03 15:12:11,695 INFO
> [PostOpenDeployTasks:03a139b02119e99ef08149addd9a7996]
> regionserver.HRegionServer(1956): Failed to report region transition, will
> retry
> java.io.InterruptedIOException: Origin: InterruptedException
> at
> org.apache.hadoop.hbase.util.ExceptionUtil.asInterrupt(ExceptionUtil.java:65)
> at
> org.apache.hadoop.hbase.protobuf.ProtobufUtil.getRemoteException(ProtobufUtil.java:313)
> at
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportRegionStateTransition(HRegionServer.java:1955)
> at
> org.apache.hadoop.hbase.regionserver.HRegionServer.postOpenDeployTasks(HRegionServer.java:1882)
> at
> org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler$PostOpenDeployTasksThread.run(OpenRegionHandler.java:241)
> Caused by: java.lang.InterruptedException: callId: 158 methodName:
> ReportRegionStateTransition param {TODO: class
> org.apache.hadoop.hbase.protobuf.generated.RegionServerStatusProtos$ReportRegionStateTransitionRequest}
> at
> io.netty.util.concurrent.DefaultPromise.await0(DefaultPromise.java:333)
> at
> io.netty.util.concurrent.DefaultPromise.await(DefaultPromise.java:266)
> at io.netty.util.concurrent.AbstractFuture.get(AbstractFuture.java:42)
> at
> org.apache.hadoop.hbase.ipc.AsyncRpcClient.call(AsyncRpcClient.java:226)
> at
> org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:213)
> at
> org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:287)
> at
> org.apache.hadoop.hbase.protobuf.generated.RegionServerStatusProtos$RegionServerStatusService$BlockingStub.reportRegionStateTransition(RegionServerStatusProtos.java:9030)
> at
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportRegionStateTransition(HRegionServer.java:1946)
> ... 2 more
> 2015-04-03 15:12:11,696 INFO
> [B.defaultRpcServer.handler=1,queue=0,port=51217]
> master.MasterRpcServices(237): Client=mantonov//10.1.4.219 set
> balanceSwitch=false
> 2015-04-03 15:12:11,696 DEBUG [main-EventThread]
> zookeeper.ZooKeeperWatcher(388): maste
> {code}
> and then:
> {code}
> 015-04-03 15:12:11,796 INFO [Thread-3019] client.HBaseAdmin$10(981): Started
> disable of tableContainedRegionOverlap
> 2015-04-03 15:12:21,641 INFO
> [B.defaultRpcServer.handler=1,queue=0,port=51217] master.HMaster(1645):
> Client=mantonov//10.1.4.219 disable tableContainedRegionOverlap
> java.lang.AssertionError:
> Expected :[]
> Actual :[NOT_DEPLOYED, HOLE_IN_REGION_CHAIN]
> <Click to see difference>
> at org.junit.Assert.fail(Assert.java:88)
> at org.junit.Assert.failNotEquals(Assert.java:743)
> at org.junit.Assert.assertEquals(Assert.java:118)
> at org.junit.Assert.assertEquals(Assert.java:144)
> at
> org.apache.hadoop.hbase.util.hbck.HbckTestingUtil.assertNoErrors(HbckTestingUtil.java:92)
> at
> org.apache.hadoop.hbase.util.TestHBaseFsck.testContainedRegionOverlap(TestHBaseFsck.java:941)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
> at
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
> at
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
> at
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
> at
> org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)