Superfast Distcp when copying data within the same hdfs cluster
---------------------------------------------------------------
Key: MAPREDUCE-2117
URL: https://issues.apache.org/jira/browse/MAPREDUCE-2117
Project: Hadoop Map/Reduce
Issue Type: Improvement
Components: distcp
Reporter: dhruba borthakur
There are use cases when distcp is used to copy a bunch of files/directories
from one part of the HDFS namespace to another part within the same HDFS
cluster. It is superfast if we can instruct relevant datanodes to make local
replicas of relevant blocks and limit network usage to a minimum. It is
especially useful to make HBase take a backup of a region with minimum
downtime.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.