[ 
https://issues.apache.org/jira/browse/HDFS-8430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15073361#comment-15073361
 ] 

Kai Zheng commented on HDFS-8430:
---------------------------------

[~walter.k.su] even in the normal case, I agree it might not be easy and need 
to think about the design. The simple algorithm would be to do it cell by cell 
as client reads the file data but the efficiency would be low. I'm wondering if 
it's possible to adjust the formula so that we can aggregate the result block 
by block in a block group as replication does, obtaining the same checksum 
result. Any insight?

> Erasure coding: update DFSClient.getFileChecksum() logic for stripe files
> -------------------------------------------------------------------------
>
>                 Key: HDFS-8430
>                 URL: https://issues.apache.org/jira/browse/HDFS-8430
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>    Affects Versions: HDFS-7285
>            Reporter: Walter Su
>            Assignee: Walter Su
>
> HADOOP-3981 introduces a  distributed file checksum algorithm. It's designed 
> for replicated block.
> {{DFSClient.getFileChecksum()}} need some updates, so it can work for striped 
> block group.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to