[
https://issues.apache.org/jira/browse/PHOENIX-4239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16187250#comment-16187250
]
James Taylor commented on PHOENIX-4239:
---------------------------------------
[~samarthjain] - IMHO, it's more of a test issue - we're running the rebuilder
way more frequently than would otherwise be run in these tests. Looks like the
exception is due to a region opening. Perhaps we should retry only in that
case? WDYT, [~rajeshbabu]? How will the online region test typically perform on
a real cluster? Will we get false positives?
{code}
2017-09-30 04:20:49,400 DEBUG
[RpcServer.FifoWFPBQ.priority.handler=1,queue=1,port=41061]
org.apache.hadoop.hbase.ipc.CallRunner(126):
RpcServer.FifoWFPBQ.priority.handler=1,queue=1,port=41061: callId: 4 service:
AdminService methodName: GetRegionInfo size: 96 connection: 67.195.81.136:49226
org.apache.hadoop.hbase.exceptions.RegionOpeningException: Region
T000013.T000015,,1506745239398.8fc67b2271d350dd278b0a1b8e458bd8. is opening on
asf916.gq1.ygridcore.net,41061,1506745136460
at
org.apache.hadoop.hbase.regionserver.HRegionServer.getRegionByEncodedName(HRegionServer.java:2972)
at
org.apache.hadoop.hbase.regionserver.RSRpcServices.getRegion(RSRpcServices.java:1140)
at
org.apache.hadoop.hbase.regionserver.RSRpcServices.getRegionInfo(RSRpcServices.java:1424)
at
org.apache.hadoop.hbase.protobuf.generated.AdminProtos$AdminService$2.callBlockingMethod(AdminProtos.java:22731)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2339)
at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:123)
at
org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:188)
at
org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:168)
2017-09-30 04:20:49,402 DEBUG [main] org.apache.phoenix.util.MetaDataUtil(550):
Region 8fc67b2271d350dd278b0a1b8e458bd8 isn't online due
to:org.apache.hadoop.hbase.exceptions.RegionOpeningException:
org.apache.hadoop.hbase.exceptions.RegionOpeningException: Region
T000013.T000015,,1506745239398.8fc67b2271d350dd278b0a1b8e458bd8. is opening on
asf916.gq1.ygridcore.net,41061,1506745136460
at
org.apache.hadoop.hbase.regionserver.HRegionServer.getRegionByEncodedName(HRegionServer.java:2972)
at
org.apache.hadoop.hbase.regionserver.RSRpcServices.getRegion(RSRpcServices.java:1140)
at
org.apache.hadoop.hbase.regionserver.RSRpcServices.getRegionInfo(RSRpcServices.java:1424)
at
org.apache.hadoop.hbase.protobuf.generated.AdminProtos$AdminService$2.callBlockingMethod(AdminProtos.java:22731)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2339)
at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:123)
at
org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:188)
at
org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:168)
{code}
FYI, looks like my change stopped the flapping.
> Fix flapping test in PartialIndexRebuilderIT
> --------------------------------------------
>
> Key: PHOENIX-4239
> URL: https://issues.apache.org/jira/browse/PHOENIX-4239
> Project: Phoenix
> Issue Type: Test
> Reporter: James Taylor
> Assignee: James Taylor
> Fix For: 4.12.0
>
> Attachments: PHOENIX-4239.patch, PHOENIX-4239_v2.patch,
> PHOENIX-4239_v3.patch, PHOENIX-4239_v4.patch, PHOENIX-4239_v5.patch
>
>
> To get more info on this flapper:
> https://www.google.com/url?q=https%3A%2F%2Fbuilds.apache.org%2Fjob%2FPhoenix-master%2F1810%2FtestReport%2Fjunit%2Forg.apache.phoenix.end2end.index%2FPartialIndexRebuilderIT%2FtestIndexWriteFailureLeavingIndexActive%2F&sa=D&sntz=1&usg=AFQjCNEj0LexiK8bm4GzGex9JUvu0DBJag
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)