[ 
https://issues.apache.org/jira/browse/HDFS-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12999099#comment-12999099
 ] 

Tsz Wo (Nicholas), SZE commented on HDFS-219:
---------------------------------------------

> I think it would be usefull to add md5sum ...

Computing a single md5 over a file (i.e. md5sum) is inefficient when the file 
is large since the md5 algorithm cannot be executed in parallel.  This is also 
the reason that we have {{MD5MD5CRC32FileChecksum}} in HDFS.

> Add md5sum facility in dfsshell
> -------------------------------
>
>                 Key: HDFS-219
>                 URL: https://issues.apache.org/jira/browse/HDFS-219
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>            Reporter: zhangwei
>
> I think it would be usefull to add md5sum (or anyone else) to dfsshell ,and 
> the facility can verify the file on hdfs.It can confirm the file is integrity 
> after copyFromLocal or copyToLocal.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to