Dong Lin created FLINK-25148:
--------------------------------

             Summary: PipelinedSubpartition.resumeConsumption throws 
IllegalStateException
                 Key: FLINK-25148
                 URL: https://issues.apache.org/jira/browse/FLINK-25148
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
    Affects Versions: 1.14.0
            Reporter: Dong Lin


We observed the following stacktrace when running the unit test 
org.apache.flink.ml.clustering.KMeansTest::testFeaturePredictionParam. See 
[https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true].

Note that the stacktrace contains an IllegalStateException from 
org.apache.flink.runtime.io.network.partition.PipelinedSubpartition.resumeConsumption(PipelinedSubpartition.java:381).
 Thus it is likely due to a bug in the Flink runtime.

 

{code}
 
Error:  testFeaturePredictionParam Time elapsed: 2.079 s <<< ERROR! 
[25158|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25158]java.lang.RuntimeException:
 Failed to fetch next result 
[25159|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25159]
 at 
org.apache.flink.streaming.api.operators.collect.CollectResultIterator.nextResultFromFetcher(CollectResultIterator.java:109)
 
[25160|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25160]
 at 
org.apache.flink.streaming.api.operators.collect.CollectResultIterator.hasNext(CollectResultIterator.java:80)
 
[25161|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25161]
 at org.apache.commons.collections.IteratorUtils.toList(IteratorUtils.java:848) 
[25162|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25162]
 at org.apache.commons.collections.IteratorUtils.toList(IteratorUtils.java:825) 
[25163|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25163]
 at 
org.apache.flink.ml.clustering.KMeansTest.executeAndCollect(KMeansTest.java:105)
 
[25164|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25164]
 at 
org.apache.flink.ml.clustering.KMeansTest.testFeaturePredictionParam(KMeansTest.java:162)
 
[25165|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25165]
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
[25166|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25166]
 at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
[25167|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25167]
 at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 
[25168|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25168]
 at java.lang.reflect.Method.invoke(Method.java:498) 
[25169|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25169]
 at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
 
[25170|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25170]
 at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
 
[25171|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25171]
 at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
 
[25172|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25172]
 at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
 
[25173|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25173]
 at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) 
[25174|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25174]
 at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) 
[25175|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25175]
 at org.apache.flink.util.TestNameProvider$1.evaluate(TestNameProvider.java:45) 
[25176|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25176]
 at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) 
[25177|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25177]
 at org.junit.rules.RunRules.evaluate(RunRules.java:20) 
[25178|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25178]
 at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) 
[25179|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25179]
 at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
 
[25180|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25180]
 at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
 
[25181|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25181]
 at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) 
[25182|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25182]
 at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) 
[25183|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25183]
 at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) 
[25184|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25184]
 at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) 
[25185|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25185]
 at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) 
[25186|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25186]
 at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48) 
[25187|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25187]
 at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48) 
[25188|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25188]
 at org.junit.rules.RunRules.evaluate(RunRules.java:20) 
[25189|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25189]
 at org.junit.runners.ParentRunner.run(ParentRunner.java:363) 
[25190|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25190]
 at org.junit.runner.JUnitCore.run(JUnitCore.java:137) 
[25191|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25191]
 at org.junit.runner.JUnitCore.run(JUnitCore.java:115) 
[25192|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25192]
 at 
org.junit.vintage.engine.execution.RunnerExecutor.execute(RunnerExecutor.java:43)
 
[25193|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25193]
 at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:183) 
[25194|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25194]
 at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193) 
[25195|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25195]
 at java.util.Iterator.forEachRemaining(Iterator.java:116) 
[25196|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25196]
 at 
java.util.Spliterators$IteratorSpliterator.forEachRemaining(Spliterators.java:1801)
 
[25197|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25197]
 at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482) 
[25198|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25198]
 at 
java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:472) 
[25199|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25199]
 at 
java.util.stream.ForEachOps$ForEachOp.evaluateSequential(ForEachOps.java:150) 
[25200|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25200]
 at 
java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(ForEachOps.java:173)
 
 
[25251|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25251]
 at 
org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:78)
 
[25252|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25252]
 at 
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:163)
 
[25253|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25253]
 at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:24) 
[25254|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25254]
 at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:20) 
[25255|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25255]
 at scala.PartialFunction.applyOrElse(PartialFunction.scala:123) 
[25256|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25256]
 at scala.PartialFunction.applyOrElse$(PartialFunction.scala:122) 
[25257|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25257]
 at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:20) 
[25258|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25258]
 at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171) 
[25259|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25259]
 at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172) 
[25260|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25260]
 at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172) 
[25261|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25261]
 at akka.actor.Actor.aroundReceive(Actor.scala:537) 
[25262|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25262]
 at akka.actor.Actor.aroundReceive$(Actor.scala:535) 
[25263|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25263]
 at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:220) 
[25264|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25264]
 at akka.actor.ActorCell.receiveMessage(ActorCell.scala:580) 
[25265|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25265]
 at akka.actor.ActorCell.invoke(ActorCell.scala:548) 
[25266|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25266]
 at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:270) 
[25267|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25267]
 at akka.dispatch.Mailbox.run(Mailbox.scala:231) 
[25268|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25268]
 at akka.dispatch.Mailbox.exec(Mailbox.scala:243) 
[25269|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25269]
 at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) 
[25270|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25270]
 at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) 
[25271|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25271]
 at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) 
[25272|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25272]
 at 
java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175) 
[25273|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25273]Caused
 by: java.lang.IllegalStateException: Should be blocked by checkpoint. 
[25274|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25274]
 at org.apache.flink.util.Preconditions.checkState(Preconditions.java:193) 
[25275|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25275]
 at 
org.apache.flink.runtime.io.network.partition.PipelinedSubpartition.resumeConsumption(PipelinedSubpartition.java:381)
 
[25276|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25276]
 at 
org.apache.flink.runtime.io.network.partition.PipelinedSubpartitionView.resumeConsumption(PipelinedSubpartitionView.java:79)
 
[25277|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25277]
 at 
org.apache.flink.runtime.io.network.partition.consumer.LocalInputChannel.resumeConsumption(LocalInputChannel.java:283)
 
[25278|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25278]
 at 
org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate.resumeConsumption(SingleInputGate.java:867)
 
[25279|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25279]
 at 
org.apache.flink.runtime.taskmanager.InputGateWithMetrics.resumeConsumption(InputGateWithMetrics.java:67)
 
[25280|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25280]
 at 
org.apache.flink.streaming.runtime.io.checkpointing.SingleCheckpointBarrierHandler.processBarrier(SingleCheckpointBarrierHandler.java:223)
 
[25281|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25281]
 at 
org.apache.flink.streaming.runtime.tasks.StreamTask.triggerUnfinishedChannelsCheckpoint(StreamTask.java:1209)
 
[25282|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25282]
 at 
org.apache.flink.streaming.runtime.tasks.StreamTask.lambda$triggerCheckpointAsync$12(StreamTask.java:1126)
 
[25283|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25283]
 at 
org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$1.runThrowing(StreamTaskActionExecutor.java:50)
 
[25284|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25284]
 at org.apache.flink.streaming.runtime.tasks.mailbox.Mail.run(Mail.java:90) 
[25285|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25285]
 at 
org.apache.flink.streaming.runtime.tasks.mailbox.MailboxExecutorImpl.yield(MailboxExecutorImpl.java:86)
 
[25286|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25286]
 at 
org.apache.flink.iteration.operator.HeadOperator.endInput(HeadOperator.java:408)
 
[25287|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25287]
 at 
org.apache.flink.streaming.runtime.tasks.StreamOperatorWrapper.endOperatorInput(StreamOperatorWrapper.java:91)
 
[25288|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25288]
 at 
org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.endInput(RegularOperatorChain.java:100)
 
[25289|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25289]
 at 
org.apache.flink.streaming.runtime.io.StreamOneInputProcessor.processInput(StreamOneInputProcessor.java:68)
 
[25290|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25290]
 at 
org.apache.flink.streaming.runtime.tasks.StreamTask.processInput(StreamTask.java:496)
 
[25291|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25291]
 at 
org.apache.flink.streaming.runtime.tasks.mailbox.MailboxProcessor.runMailboxLoop(MailboxProcessor.java:203)
 
[25292|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25292]
 at 
org.apache.flink.streaming.runtime.tasks.StreamTask.runMailboxLoop(StreamTask.java:809)
 
[25293|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25293]
 at 
org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:761) 
[25294|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25294]
 at 
org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:958)
 
[25295|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25295]
 at org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:937) 
[25296|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25296]
 at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:766) 
[25297|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25297]
 at org.apache.flink.runtime.taskmanager.Task.run(Task.java:575) 
[25298|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25298]
 at java.lang.Thread.run(Thread.java:748) 
[25299|https://github.com/apache/flink-ml/runs/4379659665?check_suite_focus=true#step:4:25299]
{code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to