[ 
https://issues.apache.org/jira/browse/MAPREDUCE-654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12756481#action_12756481
 ] 

Ravi Gummadi commented on MAPREDUCE-654:
----------------------------------------

Yes. Venkatesh is working on the new option -dryrun that displays the files to 
be copied by distcp also. This option will be renamed to -dryrun.

> Add an option -count to distcp for displaying some info about the src files
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-654
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-654
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: distcp
>    Affects Versions: 0.21.0
>            Reporter: Ravi Gummadi
>            Assignee: Ravi Gummadi
>             Fix For: 0.21.0
>
>         Attachments: d_count.patch, d_count654.patch, d_count_v1.patch
>
>
> Add an option -count to distcp for displaying metadata about src files like 
> number of files to be copied and total size of src files to be copied.
> WIth -count, distcp doesn't do any copy. Just displays info and exits.
> This is useful specifically when used with -update.
>  distcp -update -count <src>* <dst> 
>       would display the number of files to be updated and the total size of 
> copy needs to be done(by comparing the file sizes and checksums at src and 
> dst). Based on this info, users could allocate the number of nodes needed for 
> the actual update job.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to