[ 
https://issues.apache.org/jira/browse/MAPREDUCE-654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12757059#action_12757059
 ] 

Hudson commented on MAPREDUCE-654:
----------------------------------

Integrated in Hadoop-Mapreduce-trunk-Commit #49 (See 
[http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk-Commit/49/])
    . Add a -dryrun option to distcp printing a summary of the
file data to be copied, without actually performing the copy. Contributed by 
Ravi Gummadi


> Add an option -count to distcp for displaying some info about the src files
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-654
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-654
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: distcp
>    Affects Versions: 0.21.0
>            Reporter: Ravi Gummadi
>            Assignee: Ravi Gummadi
>             Fix For: 0.21.0
>
>         Attachments: d_count.patch, d_count654.patch, d_count_v1.patch, 
> M654-2.patch
>
>
> Add an option -count to distcp for displaying metadata about src files like 
> number of files to be copied and total size of src files to be copied.
> WIth -count, distcp doesn't do any copy. Just displays info and exits.
> This is useful specifically when used with -update.
>  distcp -update -count <src>* <dst> 
>       would display the number of files to be updated and the total size of 
> copy needs to be done(by comparing the file sizes and checksums at src and 
> dst). Based on this info, users could allocate the number of nodes needed for 
> the actual update job.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to