Add an option -count to distcp for displaying some info about the src files
---------------------------------------------------------------------------
Key: MAPREDUCE-654
URL: https://issues.apache.org/jira/browse/MAPREDUCE-654
Project: Hadoop Map/Reduce
Issue Type: New Feature
Components: distcp
Affects Versions: 0.21.0
Reporter: Ravi Gummadi
Assignee: Ravi Gummadi
Fix For: 0.21.0
Add an option -count to distcp for displaying metadata about src files like
number of files to be copied and total size of src files to be copied.
WIth -count, distcp doesn't do any copy. Just displays info and exits.
This is useful specifically when used with -update.
distcp -update -count <src>* <dst>
would display the number of files to be updated and the total size of
copy needs to be done(by comparing the file sizes and checksums at src and
dst). Based on this info, users could allocate the number of nodes needed for
the actual update job.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.