[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jing Zhao updated MAPREDUCE-5899:
---------------------------------

       Resolution: Fixed
    Fix Version/s: 2.5.0
           Status: Resolved  (was: Patch Available)

I've committed this to trunk and branch-2.

> Support incremental data copy in DistCp
> ---------------------------------------
>
>                 Key: MAPREDUCE-5899
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5899
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: distcp
>            Reporter: Jing Zhao
>            Assignee: Jing Zhao
>             Fix For: 2.5.0
>
>         Attachments: HADOOP-10608.000.patch, HADOOP-10608.001.patch, 
> MAPREDUCE-5899.002.patch, MAPREDUCE-5899.002.patch
>
>
> Currently when doing distcp with -update option, for two files with the same 
> file names but with different file length or checksum, we overwrite the whole 
> file. It will be good if we can detect the case where (sourceFile = 
> targetFile + appended_data), and only transfer the appended data segment to 
> the target. This will be very useful if we're doing incremental distcp.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to