[
https://issues.apache.org/jira/browse/HDFS-8430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15073361#comment-15073361
]
Kai Zheng commented on HDFS-8430:
---------------------------------
[~walter.k.su] even in the normal case, I agree it might not be easy and need
to think about the design. The simple algorithm would be to do it cell by cell
as client reads the file data but the efficiency would be low. I'm wondering if
it's possible to adjust the formula so that we can aggregate the result block
by block in a block group as replication does, obtaining the same checksum
result. Any insight?
> Erasure coding: update DFSClient.getFileChecksum() logic for stripe files
> -------------------------------------------------------------------------
>
> Key: HDFS-8430
> URL: https://issues.apache.org/jira/browse/HDFS-8430
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Affects Versions: HDFS-7285
> Reporter: Walter Su
> Assignee: Walter Su
>
> HADOOP-3981 introduces a distributed file checksum algorithm. It's designed
> for replicated block.
> {{DFSClient.getFileChecksum()}} need some updates, so it can work for striped
> block group.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)