[
https://issues.apache.org/jira/browse/FLINK-17006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Metzger closed FLINK-17006.
----------------------------------
Resolution: Cannot Reproduce
No more cases of this failure. Closing for now. Please reopen if there are new
cases.
> Various integration tests in table module fail with FileNotFoundException (in
> Rocksdb)
> --------------------------------------------------------------------------------------
>
> Key: FLINK-17006
> URL: https://issues.apache.org/jira/browse/FLINK-17006
> Project: Flink
> Issue Type: Bug
> Components: Runtime / State Backends, Table SQL / Runtime, Tests
> Affects Versions: 1.11.0
> Reporter: Robert Metzger
> Priority: Major
> Labels: test-stability
>
> AggregateITCase.testDistinctGroupBy fails with FileNotFoundException (in
> Rocksdb)
> CI run:
> [https://dev.azure.com/rmetzger/Flink/_build/results?buildId=7045&view=logs&j=e25d5e7e-2a9c-5589-4940-0b638d75a414&t=294c2388-20e6-57a2-5721-91db544b1e69]
> Log output:
> {code:java}
> 2020-04-03T17:17:44.4036304Z [ERROR] Tests run: 234, Failures: 0, Errors: 1,
> Skipped: 6, Time elapsed: 155.577 s <<< FAILURE! - in
> org.apache.flink.table.planner.runtime.stream.sql.AggregateITCase
> 2020-04-03T17:17:44.4038781Z [ERROR] testDistinctGroupBy[LocalGlobal=OFF,
> MiniBatch=ON,
> StateBackend=ROCKSDB](org.apache.flink.table.planner.runtime.stream.sql.AggregateITCase)
> Time elapsed: 0.456 s <<< ERROR!
> 2020-04-03T17:17:44.4040384Z
> org.apache.flink.runtime.client.JobExecutionException: Job execution failed.
> 2020-04-03T17:17:44.4041520Z at
> org.apache.flink.runtime.jobmaster.JobResult.toJobExecutionResult(JobResult.java:147)
> 2020-04-03T17:17:44.4042712Z at
> org.apache.flink.runtime.minicluster.MiniCluster.executeJobBlocking(MiniCluster.java:659)
> 2020-04-03T17:17:44.4043972Z at
> org.apache.flink.streaming.util.TestStreamEnvironment.execute(TestStreamEnvironment.java:77)
> 2020-04-03T17:17:44.4045540Z at
> org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1644)
> 2020-04-03T17:17:44.4047015Z at
> org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1626)
> 2020-04-03T17:17:44.4048576Z at
> org.apache.flink.streaming.api.scala.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.scala:673)
> 2020-04-03T17:17:44.4050073Z at
> org.apache.flink.table.planner.runtime.stream.sql.AggregateITCase.testDistinctGroupBy(AggregateITCase.scala:172)
> 2020-04-03T17:17:44.4051200Z at
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 2020-04-03T17:17:44.4052171Z at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 2020-04-03T17:17:44.4053308Z at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 2020-04-03T17:17:44.4054322Z at
> java.lang.reflect.Method.invoke(Method.java:498)
> 2020-04-03T17:17:44.4055410Z at
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
> 2020-04-03T17:17:44.4056570Z at
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
> 2020-04-03T17:17:44.4057800Z at
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
> 2020-04-03T17:17:44.4059019Z at
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
> 2020-04-03T17:17:44.4060178Z at
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
> 2020-04-03T17:17:44.4061261Z at
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
> 2020-04-03T17:17:44.4062617Z at
> org.junit.rules.ExpectedException$ExpectedExceptionStatement.evaluate(ExpectedException.java:239)
> 2020-04-03T17:17:44.4063782Z at
> org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
> 2020-04-03T17:17:44.4064838Z at
> org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55)
> 2020-04-03T17:17:44.4065742Z at
> org.junit.rules.RunRules.evaluate(RunRules.java:20)
> 2020-04-03T17:17:44.4066636Z at
> org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
> 2020-04-03T17:17:44.4067762Z at
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
> 2020-04-03T17:17:44.4068895Z at
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
> 2020-04-03T17:17:44.4069978Z at
> org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> 2020-04-03T17:17:44.4070920Z at
> org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> 2020-04-03T17:17:44.4071901Z at
> org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
> 2020-04-03T17:17:44.4072875Z at
> org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> 2020-04-03T17:17:44.4073850Z at
> org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
> 2020-04-03T17:17:44.4074854Z at
> org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> 2020-04-03T17:17:44.4075729Z at
> org.junit.runners.Suite.runChild(Suite.java:128)
> 2020-04-03T17:17:44.4076541Z at
> org.junit.runners.Suite.runChild(Suite.java:27)
> 2020-04-03T17:17:44.4077479Z at
> org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> 2020-04-03T17:17:44.4078422Z at
> org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> 2020-04-03T17:17:44.4079501Z at
> org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
> 2020-04-03T17:17:44.4080503Z at
> org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> 2020-04-03T17:17:44.4081483Z at
> org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
> 2020-04-03T17:17:44.4082477Z at
> org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
> 2020-04-03T17:17:44.4083522Z at
> org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48)
> 2020-04-03T17:17:44.4084529Z at
> org.junit.rules.RunRules.evaluate(RunRules.java:20)
> 2020-04-03T17:17:44.4085420Z at
> org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> 2020-04-03T17:17:44.4086433Z at
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
> 2020-04-03T17:17:44.4087696Z at
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
> 2020-04-03T17:17:44.4088900Z at
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
> 2020-04-03T17:17:44.4090109Z at
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
> 2020-04-03T17:17:44.4091331Z at
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384)
> 2020-04-03T17:17:44.4092600Z at
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345)
> 2020-04-03T17:17:44.4093737Z at
> org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126)
> 2020-04-03T17:17:44.4094894Z at
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418)
> 2020-04-03T17:17:44.4096257Z Caused by:
> org.apache.flink.runtime.JobException: Recovery is suppressed by
> FixedDelayRestartBackoffTimeStrategy(maxNumberRestartAttempts=1,
> backoffTimeMS=0)
> 2020-04-03T17:17:44.4097915Z at
> org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.handleFailure(ExecutionFailureHandler.java:110)
> 2020-04-03T17:17:44.4099539Z at
> org.apache.flink.runtime.executiongraph.failover.flip1.ExecutionFailureHandler.getFailureHandlingResult(ExecutionFailureHandler.java:76)
> 2020-04-03T17:17:44.4101039Z at
> org.apache.flink.runtime.scheduler.DefaultScheduler.handleTaskFailure(DefaultScheduler.java:190)
> 2020-04-03T17:17:44.4102353Z at
> org.apache.flink.runtime.scheduler.DefaultScheduler.maybeHandleTaskFailure(DefaultScheduler.java:184)
> 2020-04-03T17:17:44.4103808Z at
> org.apache.flink.runtime.scheduler.DefaultScheduler.updateTaskExecutionStateInternal(DefaultScheduler.java:178)
> 2020-04-03T17:17:44.4105242Z at
> org.apache.flink.runtime.scheduler.SchedulerBase.updateTaskExecutionState(SchedulerBase.java:505)
> 2020-04-03T17:17:44.4106478Z at
> org.apache.flink.runtime.jobmaster.JobMaster.updateTaskExecutionState(JobMaster.java:384)
> 2020-04-03T17:17:44.4107561Z at
> sun.reflect.GeneratedMethodAccessor16.invoke(Unknown Source)
> 2020-04-03T17:17:44.4108552Z at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 2020-04-03T17:17:44.4109560Z at
> java.lang.reflect.Method.invoke(Method.java:498)
> 2020-04-03T17:17:44.4110604Z at
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:284)
> 2020-04-03T17:17:44.4111812Z at
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:199)
> 2020-04-03T17:17:44.4113054Z at
> org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:74)
> 2020-04-03T17:17:44.4114282Z at
> org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:152)
> 2020-04-03T17:17:44.4115417Z at
> akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
> 2020-04-03T17:17:44.4116357Z at
> akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
> 2020-04-03T17:17:44.4117338Z at
> scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
> 2020-04-03T17:17:44.4118401Z at
> akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
> 2020-04-03T17:17:44.4119414Z at
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
> 2020-04-03T17:17:44.4120443Z at
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
> 2020-04-03T17:17:44.4121538Z at
> scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
> 2020-04-03T17:17:44.4122461Z at
> akka.actor.Actor$class.aroundReceive(Actor.scala:517)
> 2020-04-03T17:17:44.4123383Z at
> akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
> 2020-04-03T17:17:44.4124315Z at
> akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
> 2020-04-03T17:17:44.4125257Z at
> akka.actor.ActorCell.invoke(ActorCell.scala:561)
> 2020-04-03T17:17:44.4126107Z at
> akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
> 2020-04-03T17:17:44.4126953Z at akka.dispatch.Mailbox.run(Mailbox.scala:225)
> 2020-04-03T17:17:44.4127798Z at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
> 2020-04-03T17:17:44.4128829Z at
> akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> 2020-04-03T17:17:44.4129875Z at
> akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> 2020-04-03T17:17:44.4130957Z at
> akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> 2020-04-03T17:17:44.4132016Z at
> akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 2020-04-03T17:17:44.4133098Z Caused by: java.lang.Exception: Exception while
> creating StreamOperatorStateContext.
> 2020-04-03T17:17:44.4134528Z at
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.streamOperatorStateContext(StreamTaskStateInitializerImpl.java:191)
> 2020-04-03T17:17:44.4136100Z at
> org.apache.flink.streaming.api.operators.AbstractStreamOperator.initializeState(AbstractStreamOperator.java:246)
> 2020-04-03T17:17:44.4137580Z at
> org.apache.flink.streaming.runtime.tasks.OperatorChain.initializeStateAndOpenOperators(OperatorChain.java:293)
> 2020-04-03T17:17:44.4138925Z at
> org.apache.flink.streaming.runtime.tasks.StreamTask.lambda$beforeInvoke$0(StreamTask.java:436)
> 2020-04-03T17:17:44.4140304Z at
> org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$1.runThrowing(StreamTaskActionExecutor.java:47)
> 2020-04-03T17:17:44.4141631Z at
> org.apache.flink.streaming.runtime.tasks.StreamTask.beforeInvoke(StreamTask.java:432)
> 2020-04-03T17:17:44.4142783Z at
> org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:445)
> 2020-04-03T17:17:44.4143814Z at
> org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:718)
> 2020-04-03T17:17:44.4144935Z at
> org.apache.flink.runtime.taskmanager.Task.run(Task.java:542)
> 2020-04-03T17:17:44.4145756Z at java.lang.Thread.run(Thread.java:748)
> 2020-04-03T17:17:44.4147095Z Caused by: org.apache.flink.util.FlinkException:
> Could not restore keyed state backend for
> KeyedMapBundleOperator_f6dc7f4d2283f4605b127b9364e21148_(2/4) from any of the
> 1 provided restore options.
> 2020-04-03T17:17:44.4148863Z at
> org.apache.flink.streaming.api.operators.BackendRestorerProcedure.createAndRestore(BackendRestorerProcedure.java:135)
> 2020-04-03T17:17:44.4150449Z at
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.keyedStatedBackend(StreamTaskStateInitializerImpl.java:304)
> 2020-04-03T17:17:44.4152096Z at
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.streamOperatorStateContext(StreamTaskStateInitializerImpl.java:131)
> 2020-04-03T17:17:44.4153151Z ... 9 more
> 2020-04-03T17:17:44.4153946Z Caused by:
> org.apache.flink.runtime.state.BackendBuildingException: Caught unexpected
> exception.
> 2020-04-03T17:17:44.4155379Z at
> org.apache.flink.contrib.streaming.state.RocksDBKeyedStateBackendBuilder.build(RocksDBKeyedStateBackendBuilder.java:336)
> 2020-04-03T17:17:44.4156850Z at
> org.apache.flink.contrib.streaming.state.RocksDBStateBackend.createKeyedStateBackend(RocksDBStateBackend.java:548)
> 2020-04-03T17:17:44.4158503Z at
> org.apache.flink.streaming.api.operators.StreamTaskStateInitializerImpl.lambda$keyedStatedBackend$1(StreamTaskStateInitializerImpl.java:288)
> 2020-04-03T17:17:44.4160158Z at
> org.apache.flink.streaming.api.operators.BackendRestorerProcedure.attemptCreateAndRestore(BackendRestorerProcedure.java:142)
> 2020-04-03T17:17:44.4161662Z at
> org.apache.flink.streaming.api.operators.BackendRestorerProcedure.createAndRestore(BackendRestorerProcedure.java:121)
> 2020-04-03T17:17:44.4162706Z ... 11 more
> 2020-04-03T17:17:44.4164646Z Caused by: java.io.FileNotFoundException:
> /tmp/junit1553841028375950249/junit3479071836389613442/babdd750dc1c5b3874a0dd55d14a84f6/shared/2aa67d1b-8841-4755-84c4-b891fc8c3352
> (No such file or directory)
> 2020-04-03T17:17:44.4166003Z at java.io.FileInputStream.open0(Native Method)
> 2020-04-03T17:17:44.4166782Z at
> java.io.FileInputStream.open(FileInputStream.java:195)
> 2020-04-03T17:17:44.4167752Z at
> java.io.FileInputStream.<init>(FileInputStream.java:138)
> 2020-04-03T17:17:44.4168802Z at
> org.apache.flink.core.fs.local.LocalDataInputStream.<init>(LocalDataInputStream.java:50)
> 2020-04-03T17:17:44.4170003Z at
> org.apache.flink.core.fs.local.LocalFileSystem.open(LocalFileSystem.java:142)
> 2020-04-03T17:17:44.4171168Z at
> org.apache.flink.core.fs.SafetyNetWrapperFileSystem.open(SafetyNetWrapperFileSystem.java:85)
> 2020-04-03T17:17:44.4172447Z at
> org.apache.flink.runtime.state.filesystem.FileStateHandle.openInputStream(FileStateHandle.java:68)
> 2020-04-03T17:17:44.4173862Z at
> org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.downloadDataForStateHandle(RocksDBStateDownloader.java:126)
> 2020-04-03T17:17:44.4175519Z at
> org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.lambda$createDownloadRunnables$0(RocksDBStateDownloader.java:109)
> 2020-04-03T17:17:44.4176945Z at
> org.apache.flink.util.function.ThrowingRunnable.lambda$unchecked$0(ThrowingRunnable.java:50)
> 2020-04-03T17:17:44.4178185Z at
> java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1640)
> 2020-04-03T17:17:44.4179404Z at
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:211)
> 2020-04-03T17:17:44.4180647Z at
> java.util.concurrent.CompletableFuture.asyncRunStage(CompletableFuture.java:1654)
> 2020-04-03T17:17:44.4181769Z at
> java.util.concurrent.CompletableFuture.runAsync(CompletableFuture.java:1871)
> 2020-04-03T17:17:44.4183098Z at
> org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.downloadDataForAllStateHandles(RocksDBStateDownloader.java:83)
> 2020-04-03T17:17:44.4184752Z at
> org.apache.flink.contrib.streaming.state.RocksDBStateDownloader.transferAllStateDataToDirectory(RocksDBStateDownloader.java:67)
> 2020-04-03T17:17:44.4186598Z at
> org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.transferRemoteStateToLocalDirectory(RocksDBIncrementalRestoreOperation.java:229)
> 2020-04-03T17:17:44.4188548Z at
> org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.restoreFromRemoteState(RocksDBIncrementalRestoreOperation.java:194)
> 2020-04-03T17:17:44.4190380Z at
> org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.restoreWithoutRescaling(RocksDBIncrementalRestoreOperation.java:168)
> 2020-04-03T17:17:44.4192129Z at
> org.apache.flink.contrib.streaming.state.restore.RocksDBIncrementalRestoreOperation.restore(RocksDBIncrementalRestoreOperation.java:154)
> 2020-04-03T17:17:44.4193725Z at
> org.apache.flink.contrib.streaming.state.RocksDBKeyedStateBackendBuilder.build(RocksDBKeyedStateBackendBuilder.java:279)
> 2020-04-03T17:17:44.4194758Z ... 15 more
> 2020-04-03T17:17:44.4195053Z
> {code}
> I'm uncertain about the component assignment of this ticket. This error can
> probably have many causes?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)