[ 
https://issues.apache.org/jira/browse/HDFS-7312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14222163#comment-14222163
 ] 

Joseph Prosser commented on HDFS-7312:
--------------------------------------

Added additional testcases and usage note.
Removed extra line.
This code has been tested on a cluster running Hadoop 2.0.0-cdh4.5.0 with S3 
access using an S3 bucket.

When using the -skiptmp flag I do not see these output lines:
14/11/22 15:20:07 INFO s3a.S3AFileSystem: Delete path 
s3a://BUCKETNAME/_distcp_tmp_57fmri - recursive true
14/11/22 15:20:07 INFO s3a.S3AFileSystem: Getting path status for 
s3a://BUCKETNAME/_distcp_tmp_57fmri (_distcp_tmp_57fmri)

> Update DistCp v1 to optionally not use tmp location
> ---------------------------------------------------
>
>                 Key: HDFS-7312
>                 URL: https://issues.apache.org/jira/browse/HDFS-7312
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: tools
>    Affects Versions: 2.5.1
>            Reporter: Joseph Prosser
>            Assignee: Joseph Prosser
>            Priority: Minor
>         Attachments: HDFS-7312.001.patch, HDFS-7312.002.patch, 
> HDFS-7312.003.patch, HDFS-7312.004.patch, HDFS-7312.005.patch, 
> HDFS-7312.006.patch, HDFS-7312.patch
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> DistCp v1 currently copies files to a tmp location and then renames that to 
> the specified destination.  This can cause performance issues on filesystems 
> such as S3.  A -skiptmp flag will be added to bypass this step and copy 
> directly to the destination.  This feature mirrors a similar one added to 
> HBase ExportSnapshot 
> [HBASE-11119|https://issues.apache.org/jira/browse/HBASE-11119]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to