[
https://issues.apache.org/jira/browse/FLINK-32681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17749522#comment-17749522
]
Feifan Wang edited comment on FLINK-32681 at 8/1/23 8:03 AM:
-------------------------------------------------------------
Hi [~srichter] , I'm also very interested in this problem. I think it is caused
by the directory being created again by the download task that was already
started. I think it can be solved with a cleaner that is aware of task failure,
and I submitted a draft [PR|https://github.com/apache/flink/pull/23111]
explaining the idea (perhaps as a final fix if you think it is appropriate).
You can take a look if you don't mind. :)
was (Author: feifan wang):
Hi [~srichter] , I'm also very interested in this problem. I think it is caused
by the directory being created again by the download task that was already
started. I think it can be solved with a cleaner that is aware of task failure,
and I submitted a draft PR explaining the idea (perhaps as a final fix if
people feel it is appropriate). You can take a look if you don't mind.
> RocksDBStateDownloaderTest.testMultiThreadCleanupOnFailure unstablie
> --------------------------------------------------------------------
>
> Key: FLINK-32681
> URL: https://issues.apache.org/jira/browse/FLINK-32681
> Project: Flink
> Issue Type: Bug
> Components: Runtime / State Backends, Tests
> Affects Versions: 1.18.0
> Reporter: Chesnay Schepler
> Assignee: Stefan Richter
> Priority: Critical
> Labels: pull-request-available, test-stability
> Fix For: 1.18.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=51712&view=logs&j=77a9d8e1-d610-59b3-fc2a-4766541e0e33&t=125e07e7-8de0-5c6c-a541-a567415af3ef
> Failed 3 times in yesterdays nightly run.
> {code}
> Jul 26 01:12:46 01:12:46.889 [ERROR]
> org.apache.flink.contrib.streaming.state.RocksDBStateDownloaderTest.testMultiThreadCleanupOnFailure
> Time elapsed: 0.044 s <<< FAILURE!
> Jul 26 01:12:46 java.lang.AssertionError
> Jul 26 01:12:46 at org.junit.Assert.fail(Assert.java:87)
> Jul 26 01:12:46 at org.junit.Assert.assertTrue(Assert.java:42)
> Jul 26 01:12:46 at org.junit.Assert.assertFalse(Assert.java:65)
> Jul 26 01:12:46 at org.junit.Assert.assertFalse(Assert.java:75)
> Jul 26 01:12:46 at
> org.apache.flink.contrib.streaming.state.RocksDBStateDownloaderTest.testMultiThreadCleanupOnFailure(RocksDBStateDownloaderTest.java:151)
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)