ludun created HDFS-14621:
----------------------------
Summary: Distcp can not preserve timestamp with -delete option
Key: HDFS-14621
URL: https://issues.apache.org/jira/browse/HDFS-14621
Project: Hadoop HDFS
Issue Type: Bug
Components: distcp
Affects Versions: 3.1.2, 2.7.7
Reporter: ludun
Use distcp with -prbugpcaxt and -delete to copy data between cluster.
hadoop distcp -Dmapreduce.job.queuename="QueueA" -prbugpcaxt -update -delete
hdfs://sourcecluster/user/hive/warehouse/sum.db
hdfs://destcluster/user/hive/warehouse/sum.db
After distcp, we found the timestamp of dest is different from source, and the
timestamp of some directory was the time distcp running.
Check the code of distcp, in committer, it preserves time first then process
-delete option which will change the timestamp of dest directory. So we should
process -delete option first.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]