[
https://issues.apache.org/jira/browse/FLINK-34643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17828316#comment-17828316
]
Ryan Skraba commented on FLINK-34643:
-------------------------------------
Weird – I collected a lot of build logs yesterday from over the weekend that
resemble this error, but apparently my comment didn't get added :/ I'll go
back and find those links.
In the meantime, [~roman]: we are still seeing failures in the same test that
seem very related to this issue. Is it possible that this fix is incomplete
and should be reopened, or would you prefer that I raise a new JIRA?
*
[https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58398&view=logs&j=8fd9202e-fd17-5b26-353c-ac1ff76c8f28&t=ea7cf968-e585-52cb-e0fc-f48de023a7ca&l=8249]
{code:java}
Mar 19 01:23:06 [not all expected events logged by
org.apache.flink.runtime.jobmaster.JobMaster, logged:
Mar 19 01:23:06 [Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO
Message=Initializing job 'Flink Streaming Job'
(2ef7e557551a93ef716b6c3ba580bcd6).,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Using
restart back off time strategy NoRestartBackoffTimeStrategy for Flink Streaming
Job (2ef7e557551a93ef716b6c3ba580bcd6).,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Starting
execution of job 'Flink Streaming Job' (2ef7e557551a93ef716b6c3ba580bcd6) under
job master id 90514ce7689864236ebeb94380dc474d.,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG Message=Trigger
heartbeat request., Logger=org.apache.flink.runtime.jobmaster.JobMaster
Level=INFO Message=Connecting to ResourceManager
pekko://flink/user/rpc/resourcemanager_1(8eee414f9dea640cb3668826c12e4976),
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Resolved
ResourceManager address, beginning registration,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG
Message=Registration at ResourceManager attempt 1 (timeout=100ms),
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG
Message=Registration with ResourceManager at
pekko://flink/user/rpc/resourcemanager_1 was successful.,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO
Message=JobManager successfully registered at ResourceManager, leader id:
8eee414f9dea640cb3668826c12e4976.,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Stopping
the JobMaster for job 'Flink Streaming Job'
(2ef7e557551a93ef716b6c3ba580bcd6).,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO
Message=Disconnect TaskExecutor 23ae1952-8d6f-476e-b23b-4fad48feec15 because:
Stopping JobMaster for job 'Flink Streaming Job'
(2ef7e557551a93ef716b6c3ba580bcd6).,
Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG Message=Close
ResourceManager connection 58e840ebb5c16d7fb17f233b9e93cb3c.]]
Mar 19 01:23:06 Expecting empty but was: [Checkpoint storage is set to .*,
Mar 19 01:23:06 Running initialization on master for job .*,
Mar 19 01:23:06 Starting scheduling.*,
Mar 19 01:23:06 State backend is set to .*,
Mar 19 01:23:06 Successfully created execution graph from job graph .*,
Mar 19 01:23:06 Successfully ran initialization on master.*,
Mar 19 01:23:06 Triggering a manual checkpoint for job .*.,
Mar 19 01:23:06 Using failover strategy .*]
Mar 19 01:23:06 at
org.apache.flink.test.misc.JobIDLoggingITCase.assertJobIDPresent(JobIDLoggingITCase.java:241)
Mar 19 01:23:06 at
org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(JobIDLoggingITCase.java:170)
Mar 19 01:23:06 at java.lang.reflect.Method.invoke(Method.java:498)
Mar 19 01:23:06 at
java.util.concurrent.RecursiveAction.exec(RecursiveAction.java:189)
Mar 19 01:23:06 at
java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
Mar 19 01:23:06 at
java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
Mar 19 01:23:06 at
java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
Mar 19 01:23:06 at
java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175)
{code}
> JobIDLoggingITCase failed
> -------------------------
>
> Key: FLINK-34643
> URL: https://issues.apache.org/jira/browse/FLINK-34643
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.20.0
> Reporter: Matthias Pohl
> Assignee: Roman Khachatryan
> Priority: Major
> Labels: pull-request-available, test-stability
> Fix For: 1.20.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=8fd9202e-fd17-5b26-353c-ac1ff76c8f28&t=ea7cf968-e585-52cb-e0fc-f48de023a7ca&l=7897
> {code}
> Mar 09 01:24:23 01:24:23.498 [ERROR] Tests run: 1, Failures: 0, Errors: 1,
> Skipped: 0, Time elapsed: 4.209 s <<< FAILURE! -- in
> org.apache.flink.test.misc.JobIDLoggingITCase
> Mar 09 01:24:23 01:24:23.498 [ERROR]
> org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(ClusterClient)
> -- Time elapsed: 1.459 s <<< ERROR!
> Mar 09 01:24:23 java.lang.IllegalStateException: Too few log events recorded
> for org.apache.flink.runtime.jobmaster.JobMaster (12) - this must be a bug in
> the test code
> Mar 09 01:24:23 at
> org.apache.flink.util.Preconditions.checkState(Preconditions.java:215)
> Mar 09 01:24:23 at
> org.apache.flink.test.misc.JobIDLoggingITCase.assertJobIDPresent(JobIDLoggingITCase.java:148)
> Mar 09 01:24:23 at
> org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(JobIDLoggingITCase.java:132)
> Mar 09 01:24:23 at java.lang.reflect.Method.invoke(Method.java:498)
> Mar 09 01:24:23 at
> java.util.concurrent.RecursiveAction.exec(RecursiveAction.java:189)
> Mar 09 01:24:23 at
> java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
> Mar 09 01:24:23 at
> java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
> Mar 09 01:24:23 at
> java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
> Mar 09 01:24:23 at
> java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175)
> Mar 09 01:24:23
> {code}
> The other test failures of this build were also caused by the same test:
> *
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=2c3cbe13-dee0-5837-cf47-3053da9a8a78&t=b78d9d30-509a-5cea-1fef-db7abaa325ae&l=8349
> *
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=a596f69e-60d2-5a4b-7d39-dc69e4cdaed3&t=712ade8c-ca16-5b76-3acd-14df33bc1cb1&l=8209
--
This message was sent by Atlassian Jira
(v8.20.10#820010)