[ 
https://issues.apache.org/jira/browse/HADOOP-16440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ayush Saxena moved HDFS-14621 to HADOOP-16440:
----------------------------------------------

    Affects Version/s:     (was: 3.1.2)
                           (was: 2.7.7)
                       2.7.7
                       3.1.2
          Component/s:     (was: distcp)
                  Key: HADOOP-16440  (was: HDFS-14621)
              Project: Hadoop Common  (was: Hadoop HDFS)

> Distcp can not preserve timestamp with -delete  option
> ------------------------------------------------------
>
>                 Key: HADOOP-16440
>                 URL: https://issues.apache.org/jira/browse/HADOOP-16440
>             Project: Hadoop Common
>          Issue Type: Bug
>    Affects Versions: 3.1.2, 2.7.7
>            Reporter: ludun
>            Priority: Major
>         Attachments: HDFS-14261.001.patch, HDFS-14621.002.patch, 
> HDFS-14621.003.patch, HDFS-14621.004.patch
>
>
> Use distcp with  -prbugpcaxt and -delete to copy data between cluster.
> hadoop distcp -Dmapreduce.job.queuename="QueueA" -prbugpcaxt -update -delete  
> hdfs://sourcecluster/user/hive/warehouse/sum.db 
> hdfs://destcluster/user/hive/warehouse/sum.db
> After distcp, we found  the timestamp of dest is different from source, and 
> the timestamp of some directory was the time distcp running.
> Check the code of distcp, in CopyCommitter, it preserves time first then 
> process -delete option which will change the timestamp of dest directory. So 
> we should process -delete option first. 
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to