[
https://issues.apache.org/jira/browse/FLINK-38272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18023462#comment-18023462
]
Lei Yang edited comment on FLINK-38272 at 9/30/25 10:07 AM:
------------------------------------------------------------
The issue occurred because the task performed a state transition before the
deploying phase was completed, triggering a illegal state exception that caused
the task to to restart repeatedly. At the same time, jobEventStore.stop(false)
was manually invoked, which led to a NPE during the task restart process. To
fix this, we can defer the state transition until the all tasks have deployed.
was (Author: noorall):
The issue occurred because the JM failover triggered a task restart. At the
same time, {{jobEventStore.stop(false)}} was manually executed, which led to a
NPE during the task restart process. To fix this, we can cancel the task before
invoking {{jobEventStore.stop(false)}} so as to avoid the restart.
> The BatchJobRecoveryTest failed of azure cron adaptive scheduler pipeline
> -------------------------------------------------------------------------
>
> Key: FLINK-38272
> URL: https://issues.apache.org/jira/browse/FLINK-38272
> Project: Flink
> Issue Type: Bug
> Components: Tests
> Reporter: Ruan Hang
> Assignee: Lei Yang
> Priority: Major
> Labels: pull-request-available
> Fix For: 2.2.0
>
>
> Details:
> [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=69413&view=logs&j=77a9d8e1-d610-59b3-fc2a-4766541e0e33&t=25baecb7-cea0-597a-6b01-188b1478210d]
> [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=69435&view=logs&j=77a9d8e1-d610-59b3-fc2a-4766541e0e33&t=25baecb7-cea0-597a-6b01-188b1478210d]
> {code:java}
> Aug 19 03:11:00 03:11:00.154 [ERROR] Error occurred in starting fork, check
> output in log Aug 19 03:11:00 03:11:00.154 [ERROR] Process Exit Code: 239 Aug
> 19 03:11:00 03:11:00.154 [ERROR] Crashed tests: Aug 19 03:11:00 03:11:00.154
> [ERROR] org.apache.flink.runtime.scheduler.adaptivebatch.BatchJobRecoveryTest
> Aug 19 03:11:00 03:11:00.154 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.awaitResultsDone(ForkStarter.java:456)
> Aug 19 03:11:00 03:11:00.154 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.runSuitesForkOnceMultiple(ForkStarter.java:358)
> Aug 19 03:11:00 03:11:00.154 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:296)
> Aug 19 03:11:00 03:11:00.154 [ERROR] at
> org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:250)
> Aug 19 03:11:00 03:11:00.154 [ERROR] at
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1240)
> Aug 19 03:11:00 03:11:00.154 [ERROR] at
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:1089)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:905)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:137)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.MojoExecutor.doExecute2(MojoExecutor.java:370)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.MojoExecutor.doExecute(MojoExecutor.java:351)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:215)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:171)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:163)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:117)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:81)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:56)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:128)
> Aug 19 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:294) Aug 19
> 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:192) Aug 19
> 03:11:00 03:11:00.155 [ERROR] at
> org.apache.maven.DefaultMaven.execute(DefaultMaven.java:105) Aug 19 03:11:00
> 03:11:00.155 [ERROR] at
> org.apache.maven.cli.MavenCli.execute(MavenCli.java:960) Aug 19 03:11:00
> 03:11:00.155 [ERROR] at
> org.apache.maven.cli.MavenCli.doMain(MavenCli.java:293)
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)