[
https://issues.apache.org/jira/browse/FLINK-19635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233629#comment-17233629
]
Leonard Xu commented on FLINK-19635:
------------------------------------
{code:java}
{code}
04:35:14,975 [hbase-upsert-sink-flusher-thread-1] ERROR
org.apache.hadoop.hbase.client.AsyncProcess [] - Failed to get region location
org.apache.hadoop.hbase.DoNotRetryIOException: hconnection-0x932344b closed at
org.apache.hadoop.hbase.client.ConnectionImplementation.checkClosed(ConnectionImplementation.java:591)
~[hbase-client-2.2.3.jar:2.2.3] at
org.apache.hadoop.hbase.client.ConnectionImplementation.locateRegion(ConnectionImplementation.java:776)
~[hbase-client-2.2.3.jar:2.2.3] at
org.apache.hadoop.hbase.client.AsyncProcess.submit(AsyncProcess.java:273)
~[hbase-client-2.2.3.jar:2.2.3] at
org.apache.hadoop.hbase.client.AsyncProcess.submit(AsyncProcess.java:228)
~[hbase-client-2.2.3.jar:2.2.3] at
org.apache.hadoop.hbase.client.BufferedMutatorImpl.doFlush(BufferedMutatorImpl.java:303)
~[hbase-client-2.2.3.jar:2.2.3] at
org.apache.hadoop.hbase.client.BufferedMutatorImpl.flush(BufferedMutatorImpl.java:280)
~[hbase-client-2.2.3.jar:2.2.3] at
org.apache.flink.connector.hbase.sink.HBaseSinkFunction.flush(HBaseSinkFunction.java:189)
~[flink-connector-hbase-base_2.11-1.12-SNAPSHOT.jar:1.12-SNAPSHOT] at
org.apache.flink.connector.hbase.sink.HBaseSinkFunction.lambda$open$0(HBaseSinkFunction.java:134)
~[flink-connector-hbase-base_2.11-1.12-SNAPSHOT.jar:1.12-SNAPSHOT] at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
[?:1.8.0_242] at
java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [?:1.8.0_242]
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
[?:1.8.0_242] at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
[?:1.8.0_242] at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
[?:1.8.0_242] at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
[?:1.8.0_242] at java.lang.Thread.run(Thread.java:748)
[?:1.8.0_242]04:35:14,975 [flink-akka.actor.default-dispatcher-4] INFO
org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Un-registering task and
sending final execution state FINISHED to JobManager for task Source:
TableSourceScan(table=[[default_catalog, default_database, testTable1]],
fields=[rowkey, family1, family2, family3, family4]) -> Sink:
Sink(table=[default_catalog.default_database.testTable3], fields=[rowkey,
family1, family2, family3, family4]) (1/32)
65f7b61a1994ee9b341a2a7f5c6cca3a_cbc357ccb763df2852fee8c4fc7d55f2_0_0.04:35:14,976
[flink-akka.actor.default-dispatcher-4] INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Source:
TableSourceScan(table=[[default_catalog, default_database, testTable1]],
fields=[rowkey, family1, family2, family3, family4]) -> Sink:
Sink(table=[default_catalog.default_database.testTable3], fields=[rowkey,
family1, family2, family3, family4]) (1/32)
(65f7b61a1994ee9b341a2a7f5c6cca3a_cbc357ccb763df2852fee8c4fc7d55f2_0_0)
switched from RUNNING to FINISHED.04:35:14,986 [RS-EventLoopGroup-3-54] INFO
SecurityLogger.org.apache.hadoop.hbase.Server
[] - Connection from 192.168.192.2:49496, version=2.2.3, sasl=false,
ugi=agent01_azpcontainer (auth:SIMPLE), service=ClientService04:35:15,016 [
htable-pool146-t1] WARN org.apache.hadoop.hbase.client.AsyncRequestFutureImpl
[] - id=175, table=testTable3, attempt=1/16, failureCount=1ops, last
exception=org.apache.hadoop.hbase.DoNotRetryIOException: hconnection-0x932344b
closed on 295934f72d17,36709,1602650080067, tracking started Wed Oct 14
04:35:14 UTC 2020; *NOT retrying, failed=1 -- final attempt!*
from the failed log, I think the mismatch result is because one soure task
failed to scan the data, we set the
*hbase.client.retries.number* to "*1*" which means do not retry, but the
default value in hbase-1.4 is "*35*" and in hbase-2.2 is "*15*", so we should
at least allow the hbase client do retry.
> HBaseConnectorITCase.testTableSourceSinkWithDDL is unstable with a result
> mismatch
> ----------------------------------------------------------------------------------
>
> Key: FLINK-19635
> URL: https://issues.apache.org/jira/browse/FLINK-19635
> Project: Flink
> Issue Type: Bug
> Components: Connectors / HBase
> Affects Versions: 1.12.0
> Reporter: Robert Metzger
> Assignee: Leonard Xu
> Priority: Critical
> Labels: test-stability
> Fix For: 1.12.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=7562&view=logs&j=d44f43ce-542c-597d-bf94-b0718c71e5e8&t=03dca39c-73e8-5aaf-601d-328ae5c35f20
> {code}
> 2020-10-14T04:35:36.9268975Z testTableSourceSinkWithDDL[planner =
> BLINK_PLANNER, legacy =
> false](org.apache.flink.connector.hbase2.HBaseConnectorITCase) Time elapsed:
> 3.131 sec <<< FAILURE!
> 2020-10-14T04:35:36.9276246Z java.lang.AssertionError:
> expected:<[1,10,Hello-1,100,1.01,false,Welt-1,2019-08-18T19:00,2019-08-18,19:00,12345678.0001,
>
> 2,20,Hello-2,200,2.02,true,Welt-2,2019-08-18T19:01,2019-08-18,19:01,12345678.0002,
>
> 3,30,Hello-3,300,3.03,false,Welt-3,2019-08-18T19:02,2019-08-18,19:02,12345678.0003,
>
> 4,40,null,400,4.04,true,Welt-4,2019-08-18T19:03,2019-08-18,19:03,12345678.0004,
>
> 5,50,Hello-5,500,5.05,false,Welt-5,2019-08-19T19:10,2019-08-19,19:10,12345678.0005,
>
> 6,60,Hello-6,600,6.06,true,Welt-6,2019-08-19T19:20,2019-08-19,19:20,12345678.0006,
>
> 7,70,Hello-7,700,7.07,false,Welt-7,2019-08-19T19:30,2019-08-19,19:30,12345678.0007,
>
> 8,80,null,800,8.08,true,Welt-8,2019-08-19T19:40,2019-08-19,19:40,12345678.0008]>
> but
> was:<[1,10,Hello-1,100,1.01,false,Welt-1,2019-08-18T19:00,2019-08-18,19:00,12345678.0001,
>
> 2,20,Hello-2,200,2.02,true,Welt-2,2019-08-18T19:01,2019-08-18,19:01,12345678.0002,
>
> 3,30,Hello-3,300,3.03,false,Welt-3,2019-08-18T19:02,2019-08-18,19:02,12345678.0003]>
> 2020-10-14T04:35:36.9281340Z at org.junit.Assert.fail(Assert.java:88)
> 2020-10-14T04:35:36.9282023Z at
> org.junit.Assert.failNotEquals(Assert.java:834)
> 2020-10-14T04:35:36.9328385Z at
> org.junit.Assert.assertEquals(Assert.java:118)
> 2020-10-14T04:35:36.9338939Z at
> org.junit.Assert.assertEquals(Assert.java:144)
> 2020-10-14T04:35:36.9339880Z at
> org.apache.flink.connector.hbase2.HBaseConnectorITCase.testTableSourceSinkWithDDL(HBaseConnectorITCase.java:449)
> 2020-10-14T04:35:36.9341003Z at
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)