[ 
https://issues.apache.org/jira/browse/HDFS-9613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Zheng updated HDFS-9613:
----------------------------
    Description: While working on related issue, it was noticed there are some 
places in {{distcp}} that's better to be improved and cleaned up. Particularly, 
after a file is coped to target cluster, it will check the copied file is fine 
or not. For replicated files, when checking, if the source block size and 
checksum option are not preserved while copying, we can avoid comparing the 
file checksums, which may save some time.  (was: While working on related 
issue, it was noticed there are some places in {{distcp}} that's better to be 
improved and cleaned up. Particularly, after a file is coped to target cluster, 
it will check the copied file is fine or not. When checking, better to check 
block size first, then the checksum, because the later is a little expensive.)

> Don't compare file checksums after copy if block size and checksum opt are 
> not preserved for replicated files
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-9613
>                 URL: https://issues.apache.org/jira/browse/HDFS-9613
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Kai Zheng
>            Assignee: Kai Zheng
>            Priority: Minor
>         Attachments: HDFS-9613-v1.patch, HDFS-9613-v2.patch
>
>
> While working on related issue, it was noticed there are some places in 
> {{distcp}} that's better to be improved and cleaned up. Particularly, after a 
> file is coped to target cluster, it will check the copied file is fine or 
> not. For replicated files, when checking, if the source block size and 
> checksum option are not preserved while copying, we can avoid comparing the 
> file checksums, which may save some time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to