[
https://issues.apache.org/jira/browse/HADOOP-8143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17102093#comment-17102093
]
Mithun Radhakrishnan commented on HADOOP-8143:
----------------------------------------------
Sorry for the late reply. I am supportive of rolling this back. +1, non-binding.
The workaround I have suggested is unwieldy. And this change was not intended
to mess up non-HDFS DistCp sources/targets.
bq. What made sense back then doesn't make sense now.
Agreed, [~kihwal]. I suspect production DistCp jobs through Oozie DistCp
Actions might already be preserving block-sizes.
Given that HDFS-13056 is in, DistCp should now be free to do CRC checks,
without depending on matching HDFS block sizes.
> Change distcp to have -pb on by default
> ---------------------------------------
>
> Key: HADOOP-8143
> URL: https://issues.apache.org/jira/browse/HADOOP-8143
> Project: Hadoop Common
> Issue Type: Improvement
> Reporter: Dave Thompson
> Assignee: Mithun Radhakrishnan
> Priority: Minor
> Fix For: 3.0.0-alpha4
>
> Attachments: HADOOP-8143.1.patch, HADOOP-8143.2.patch,
> HADOOP-8143.3.patch
>
>
> We should have the preserve blocksize (-pb) on in distcp by default.
> checksum which is on by default will always fail if blocksize is not the same.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]