[
https://issues.apache.org/jira/browse/MAPREDUCE-5899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tsz Wo Nicholas Sze updated MAPREDUCE-5899:
-------------------------------------------
+1 the new patch looks good.
> Support incremental data copy in DistCp
> ---------------------------------------
>
> Key: MAPREDUCE-5899
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5899
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: distcp
> Reporter: Jing Zhao
> Assignee: Jing Zhao
> Attachments: HADOOP-10608.000.patch, HADOOP-10608.001.patch,
> MAPREDUCE-5899.002.patch
>
>
> Currently when doing distcp with -update option, for two files with the same
> file names but with different file length or checksum, we overwrite the whole
> file. It will be good if we can detect the case where (sourceFile =
> targetFile + appended_data), and only transfer the appended data segment to
> the target. This will be very useful if we're doing incremental distcp.
--
This message was sent by Atlassian JIRA
(v6.2#6252)