[
https://issues.apache.org/jira/browse/FLINK-26568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17504084#comment-17504084
]
Matthias Pohl commented on FLINK-26568:
---------------------------------------
One error I could extract is the following one:
{code}
org.apache.flink.runtime.jobmaster.ExecutionGraphException: The execution
attempt 88043f67dcf0f1684c00d3d509ef677b was not found.
at
org.apache.flink.runtime.jobmaster.JobMaster.updateTaskExecutionState(JobMaster.java:449)
~[flink-runtime-1.15-SNAPSHOT.jar:1.15-SNAPSHOT]
at sun.reflect.GeneratedMethodAccessor24.invoke(Unknown Source) ~[?:?]
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
~[?:1.8.0_292]
at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_292]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.lambda$handleRpcInvocation$1(AkkaRpcActor.java:304)
~[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at
org.apache.flink.runtime.concurrent.akka.ClassLoadingUtils.runWithContextClassLoader(ClassLoadingUtils.java:83)
~[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcInvocation(AkkaRpcActor.java:302)
~[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(AkkaRpcActor.java:217)
~[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at
org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.handleRpcMessage(FencedAkkaRpcActor.java:78)
~[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(AkkaRpcActor.java:163)
~[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:24)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:20)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at scala.PartialFunction.applyOrElse(PartialFunction.scala:123)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at scala.PartialFunction.applyOrElse$(PartialFunction.scala:122)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:20)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:172)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.actor.Actor.aroundReceive(Actor.scala:537)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.actor.Actor.aroundReceive$(Actor.scala:535)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:220)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:580)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.actor.ActorCell.invoke(ActorCell.scala:548)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:270)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.dispatch.Mailbox.run(Mailbox.scala:231)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at akka.dispatch.Mailbox.exec(Mailbox.scala:243)
[flink-rpc-akka_51c96179-864e-49ba-90fd-85361d225b91.jar:1.15-SNAPSHOT]
at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
[?:1.8.0_292]
at
java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
[?:1.8.0_292]
at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
[?:1.8.0_292]
at
java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175)
[?:1.8.0_292]
{code}
It's hard to identify the relevant logs because of the log file being so large
(12G). The test itself isn't extended with {{TestLogger}} which makes it harder
to find the right log lines.
> BlockingShuffleITCase.testDeletePartitionFileOfBoundedBlockingShuffle timing
> out on Azure
> -----------------------------------------------------------------------------------------
>
> Key: FLINK-26568
> URL: https://issues.apache.org/jira/browse/FLINK-26568
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Task, Tests
> Affects Versions: 1.15.0
> Reporter: Matthias Pohl
> Priority: Critical
> Fix For: 1.15.0
>
>
> [This
> build|https://dev.azure.com/mapohl/flink/_build/results?buildId=845&view=logs&j=0a15d512-44ac-5ba5-97ab-13a5d066c22c&t=9a028d19-6c4b-5a4e-d378-03fca149d0b1&l=12865]
> timed out due the test
> {{BlockingShuffleITCase.testDeletePartitionFileOfBoundedBlockingShuffle}} not
> finishing.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)