[ 
https://issues.apache.org/jira/browse/HADOOP-17256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

liuxiaolong updated HADOOP-17256:
---------------------------------
    Attachment: image-2020-09-10-17-47-01-653.png

> DistCp -update option will be invalid when distcp files from hdfs to S3
> -----------------------------------------------------------------------
>
>                 Key: HADOOP-17256
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17256
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: tools/distcp
>            Reporter: liuxiaolong
>            Priority: Major
>         Attachments: image-2020-09-10-17-25-46-354.png, 
> image-2020-09-10-17-33-50-505.png, image-2020-09-10-17-45-16-998.png, 
> image-2020-09-10-17-47-01-653.png
>
>
> We use distcp with -update option to copy a dir from hdfs to S3. When we run 
> distcp job once more, it will overwrite S3 dir directly, rather than skip the 
> same files.
>  
> Test Case:
> 1. Run twice distcp cmd,  the modify time of S3 files will be modified
> hadoop distcp -update /testA/ s3a://tiered-storage-bigdata-1251625956/testA/
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to