[ 
https://issues.apache.org/jira/browse/FLINK-33288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17776081#comment-17776081
 ] 

Xin Chen commented on FLINK-33288:
----------------------------------

[~Zhanghao Chen] Yes, and I'm still a bit confused. You can see that 
job-result-store path and HA-storage-appid path is  parallel, such as:

{code:java}
/flink/recovery/application_1697525535591_0001
/flink/recovery/job-result-store/application_1697525535591_0001
{code}

and Your code:

{code:java}
final Path clusterHaStoragePath =
                
HighAvailabilityServicesUtils.getClusterHighAvailableStoragePath(configuration);
{code}
'clusterHaStoragePath ' can be 
'/flink/recovery/application_1697525535591_0001', we can delete that. Deleting 
it every time does not seem to affect the job-result-store-path: 
'/flink/recovery/job-result-store/application_1697525535591_0001'! 

So why it can broke the job result store feature?


> Empty directory residue with appid name in HA(highly-available) related 
> directory of hdfs, not cleaned
> ------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-33288
>                 URL: https://issues.apache.org/jira/browse/FLINK-33288
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Configuration
>    Affects Versions: 1.16.2, 1.17.1
>            Reporter: Xin Chen
>            Priority: Major
>         Attachments: image-2023-10-17-16-43-07-859.png, screenshot-1.png, 
> screenshot-2.png
>
>
> When I submitted a large number of tasks in Flink-on-Yarn mode and 
> successfully executed, I unexpectedly found a large number of empty 
> directories left in the directory related to 'high availability.storageDir' 
> on hdfs, with appids as shown below. I believe this must be cleared! However, 
> after verification in the environments of 1.16.2 and 1.17.1, it was proven 
> that neither of them solved this problem.
> my flink-conf.yaml about 'high availability.storageDir':
> {code:java}
> high-availability.storageDir: hdfs://hdfsHACluster/flink/recovery
> {code}
>  !screenshot-1.png! 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to