[ 
https://issues.apache.org/jira/browse/HADOOP-16440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ayush Saxena updated HADOOP-16440:
----------------------------------
       Resolution: Fixed
     Hadoop Flags: Reviewed
    Fix Version/s: 3.1.3
                   3.2.1
                   3.3.0
           Status: Resolved  (was: Patch Available)

> Distcp can not preserve timestamp with -delete  option
> ------------------------------------------------------
>
>                 Key: HADOOP-16440
>                 URL: https://issues.apache.org/jira/browse/HADOOP-16440
>             Project: Hadoop Common
>          Issue Type: Bug
>    Affects Versions: 2.7.7, 3.1.2
>            Reporter: ludun
>            Assignee: ludun
>            Priority: Major
>             Fix For: 3.3.0, 3.2.1, 3.1.3
>
>         Attachments: HDFS-14261.001.patch, HDFS-14621.002.patch, 
> HDFS-14621.003.patch, HDFS-14621.004.patch
>
>
> Use distcp with  -prbugpcaxt and -delete to copy data between cluster.
> hadoop distcp -Dmapreduce.job.queuename="QueueA" -prbugpcaxt -update -delete  
> hdfs://sourcecluster/user/hive/warehouse/sum.db 
> hdfs://destcluster/user/hive/warehouse/sum.db
> After distcp, we found  the timestamp of dest is different from source, and 
> the timestamp of some directory was the time distcp running.
> Check the code of distcp, in CopyCommitter, it preserves time first then 
> process -delete option which will change the timestamp of dest directory. So 
> we should process -delete option first. 
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to