distcp -update launches job when there is at least one dir in source paths to be copied, even though there is nothing to copy -----------------------------------------------------------------------------------------------------------------------------
Key: HADOOP-6053 URL: https://issues.apache.org/jira/browse/HADOOP-6053 Project: Hadoop Core Issue Type: Improvement Components: tools/distcp Affects Versions: 0.21.0 Reporter: Ravi Gummadi Priority: Minor Fix For: 0.21.0 distcp -update launches job when there is at least one dir in source paths to be copied, even though there is nothing to copy. HADOOP-5675 added fileCount > 0 to be checked to decide whether to launch job. And HADOOP-5762 changed this to fileCount + dirCount > 0 to solve the issue of empty directories not getting copied to destination. With -update, dirCount is incremented without checking if that dir already exists at the destination. So distcp job is launched because of dirCount > 0 even though there is nothing to copy. Incrementing dirCount can be skipped if that dir already exists at the destination in case of -update. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.