[
https://issues.apache.org/jira/browse/FLINK-26105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17493772#comment-17493772
]
Matthias Pohl commented on FLINK-26105:
---------------------------------------
I updated the title and added additional affected versions because this issue
is also present in older versions of Flink.
> Rolling log filenames cause end-to-end test to fail (example test failure
> "Running HA (hashmap, async)")
> --------------------------------------------------------------------------------------------------------
>
> Key: FLINK-26105
> URL: https://issues.apache.org/jira/browse/FLINK-26105
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.15.0, 1.13.6, 1.14.3
> Reporter: Yun Gao
> Assignee: Matthias Pohl
> Priority: Critical
> Labels: pull-request-available, test-stability
>
> {code:java}
> Feb 14 01:31:29 Killed TM @ 255483
> Feb 14 01:31:29 Starting new TM.
> Feb 14 01:31:42 Killed TM @ 258722
> Feb 14 01:31:42 Starting new TM.
> Feb 14 01:32:00 Checking for non-empty .out files...
> Feb 14 01:32:00 No non-empty .out files.
> Feb 14 01:32:00 FAILURE: A JM did not take over.
> Feb 14 01:32:00 One or more tests FAILED.
> Feb 14 01:32:00 Stopping job timeout watchdog (with pid=250820)
> Feb 14 01:32:00 Killing JM watchdog @ 252644
> Feb 14 01:32:00 Killing TM watchdog @ 253262
> Feb 14 01:32:00 [FAIL] Test script contains errors.
> Feb 14 01:32:00 Checking of logs skipped.
> Feb 14 01:32:00
> Feb 14 01:32:00 [FAIL] 'Running HA (hashmap, async) end-to-end test' failed
> after 2 minutes and 51 seconds! Test exited with exit code 1
> Feb 14 01:32:00
> 01:32:00 ##[group]Environment Information
> Feb 14 01:32:01 Searching for .dump, .dumpstream and related files in
> '/home/vsts/work/1/s'
> dmesg: read kernel buffer failed: Operation not permitted
> Feb 14 01:32:06 Stopping taskexecutor daemon (pid: 259377) on host
> fv-az313-602.
> Feb 14 01:32:07 Stopping standalonesession daemon (pid: 256528) on host
> fv-az313-602.
> Feb 14 01:32:08 Stopping zookeeper...
> Feb 14 01:32:08 Stopping zookeeper daemon (pid: 251023) on host fv-az313-602.
> Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 251636), because it is not
> running anymore on fv-az313-602.
> Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 255483), because it is not
> running anymore on fv-az313-602.
> Feb 14 01:32:09 Skipping taskexecutor daemon (pid: 258722), because it is not
> running anymore on fv-az313-602.
> The STDIO streams did not close within 10 seconds of the exit event from
> process '/usr/bin/bash'. This may indicate a child process inherited the
> STDIO streams and has not yet exited.
> ##[error]Bash exited with code '1'.
> {code}
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=31347&view=logs&j=e9d3d34f-3d15-59f4-0e3e-35067d100dfe&t=f8a6d3eb-38cf-5cca-9a99-d0badeb5fe62&l=8020
--
This message was sent by Atlassian Jira
(v8.20.1#820001)