[ 
https://issues.apache.org/jira/browse/FLINK-25539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

王康 updated FLINK-25539:
-----------------------
    Attachment: screenshot-1.png

> Create a connector with the batch tableenvironment of flick to read OSS 
> files. The parallelism is set to 16. When reading, there are sometimes thread 
> errors: null IO stream
> ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-25539
>                 URL: https://issues.apache.org/jira/browse/FLINK-25539
>             Project: Flink
>          Issue Type: Bug
>    Affects Versions: 1.13.2
>         Environment: linux:三台机器部署flink-taskmanage,16核内存32g
>            Reporter: 王康
>            Priority: Major
>         Attachments: screenshot-1.png
>
>
> English:
> Null IO stream: use the Flink version 1.13.2, and use the connector to read 
> OSS files,
> When the parallelism of the batchtableenvironment environment is set to 16, 
> there are three machine nodes, and then one of the node servers will 
> sometimes read null IO stream, resulting in job failure
> Chinese:Null IO stream:用的flink版本1.13.2,用连接器去读取oss文件,
> BatchTableEnvironment环境并行度设为16,三台机器节点,然后其中一台节点服务器有时就会出现读数时报错Null IO 
> stream,导致job失败
> {code:java}
> ExecutionEnvironment fbEnv = ExecutionEnvironment.getExecutionEnvironment();
> BatchTableEnvironment tableEnv = BatchTableEnvironment.create(fbEnv);
> fbEnv.setParallelism(16);
> //
> tableEnv.connect(new FileSystem().path(ossPath))
>         .withFormat(new 
> Csv().fieldDelimiter(allTable.getSeparator().charAt(0)))
>         .withSchema(schema)
>         .createTemporaryTable(ossTableName); {code}
> {code:java}
> //以下就是flink报错的具体信息
> org.apache.flink.runtime.JobException: Recovery is suppressed by 
> NoRestartBackoffTimeStrategy   at 
> org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.handleFailure(ExecutionFailureHandler.java:138)
>    at 
> org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.getFailureHandlingResult(ExecutionFailureHandler.java:82)
>    at 
> org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskFailure(DefaultScheduler.java:216)
>    at 
> org.apache.flink.runtime.scheduler.DefaultScheduler.maybeHandleTaskFailure(DefaultScheduler.java:206)
>    at 
> org.apache.flink.runtime.scheduler.DefaultScheduler.updateTaskExecutionStateInternal(DefaultScheduler.java:197)
>    at 
> org.apache.flink.runtime.scheduler.SchedulerBase.updateTaskExecutionState(SchedulerBase.java:682)
>    at 
> org.apache.flink.runtime.scheduler.SchedulerNG.updateTaskExecutionState(SchedulerNG.java:79)
>    at 
> org.apache.flink.runtime.jobmaster.JobMaster.updateTaskExecutionState(JobMaster.java:435)
>    at sun.reflect.GeneratedMethodAccessor113.invoke(Unknown Source)   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>    at java.lang.reflect.Method.invoke(Method.java:498)   at 
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:305)
>    at 
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:212)
>    at 
> org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:77)
>    at 
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:158)
>    at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)   at 
> akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)   at 
> scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)   at 
> akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)   at 
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)   at 
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)   at 
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)   at 
> akka.actor.Actor$class.aroundReceive(Actor.scala:517)   at 
> akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)   at 
> akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)   at 
> akka.actor.ActorCell.invoke(ActorCell.scala:561)   at 
> akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)   at 
> akka.dispatch.Mailbox.run(Mailbox.scala:225)   at 
> akka.dispatch.Mailbox.exec(Mailbox.scala:235)   at 
> akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)   at 
> akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339) 
>   at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)   
> at 
> akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
>  Caused by: java.io.IOException: Null IO stream   at 
> org.apache.flink.fs.shaded.hadoop3.org.apache.hadoop.fs.aliyun.oss.AliyunOSSInputStream.reopen(AliyunOSSInputStream.java:176)
>    at 
> org.apache.flink.fs.shaded.hadoop3.org.apache.hadoop.fs.aliyun.oss.AliyunOSSInputStream.read(AliyunOSSInputStream.java:235)
>    at java.io.DataInputStream.read(DataInputStream.java:149)   at 
> org.apache.flink.fs.osshadoop.common.HadoopDataInputStream.read(HadoopDataInputStream.java:96)
>    at 
> org.apache.flink.api.common.io.DelimitedInputFormat.fillBuffer(DelimitedInputFormat.java:742)
>    at 
> org.apache.flink.api.common.io.DelimitedInputFormat.readLine(DelimitedInputFormat.java:586)
>    at 
> org.apache.flink.api.common.io.DelimitedInputFormat.open(DelimitedInputFormat.java:505)
>    at 
> org.apache.flink.api.common.io.DelimitedInputFormat.open(DelimitedInputFormat.java:50)
>    at 
> org.apache.flink.runtime.operators.DataSourceTask.invoke(DataSourceTask.java:183)
>    at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:779)   at 
> org.apache.flink.runtime.taskmanager.Task.run(Task.java:566)   at 
> java.lang.Thread.run(Thread.java:745)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to