[
https://issues.apache.org/jira/browse/HADOOP-18582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
10000kang updated HADOOP-18582:
-------------------------------
Description:
it not necessary to do `cleanupTempFiles` while ditcp commit job in direct
mode, because it there is no temp files in direct mode.
This clean operation will increase the task execution time, because it will get
the list of files in the target path. When the number of files in the target
path is very large, this operation will be very slow.
was:
it not necessary to do `cleanupTempFiles` while ditcp commit job in direct ,
because it there is no temp files in direct mode.
This clean operation will increase the task execution time, because it will get
the list of files in the target path. When the number of files in the target
path is very large, this operation will be very slow.
> No need to clean tmp files in ditcp direct mode
> -----------------------------------------------
>
> Key: HADOOP-18582
> URL: https://issues.apache.org/jira/browse/HADOOP-18582
> Project: Hadoop Common
> Issue Type: Bug
> Components: tools/distcp
> Affects Versions: 3.3.4
> Reporter: 10000kang
> Priority: Critical
>
> it not necessary to do `cleanupTempFiles` while ditcp commit job in direct
> mode, because it there is no temp files in direct mode.
> This clean operation will increase the task execution time, because it will
> get the list of files in the target path. When the number of files in the
> target path is very large, this operation will be very slow.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]