[ 
https://issues.apache.org/jira/browse/HDFS-7354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14382400#comment-14382400
 ] 

Zhe Zhang commented on HDFS-7354:
---------------------------------

Thanks Nicholas for the suggestion.

Under HDFS-7369 I [proposed | 
https://issues.apache.org/jira/browse/HDFS-7369?focusedCommentId=14318849&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14318849]
 a first order approximation to estimate the risk level of striped blocks:
# A block group with _m_ data blocks and _k_ parity blocks is equivalent to a 
contiguous block with replication factor _k+1_, because both can tolerate _k_ 
failures
#If _n_ blocks are healthy among the _m+k_ blocks, it's equivalent to having 
_n-m+1_ healthy replicas, calculated from _(k+1) - ((m+k) - n)_, or 
_replicationFactor - numLostReplicas_.

I believe [~jingzhao] is working on a patch to detect at-risk striped blocks 
and add them to {{UnderReplicatedBlocks}}. This JIRA aims to further 
differentiate parity blocks. Assuming a block group has lost 1 block. If that 
missing block is parity data, the priority should be even lower than missing a 
raw data block, because client will suffer from longer latency if a raw data 
block is missing.

> Support parity blocks in block management
> -----------------------------------------
>
>                 Key: HDFS-7354
>                 URL: https://issues.apache.org/jira/browse/HDFS-7354
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Zhe Zhang
>            Assignee: Zhe Zhang
>
> Parity blocks are not accessed during normal I/O operations. They should 
> therefore be treated with lower priority in the block recovery framework. 
> This JIRA tracks this effort as well as other special treatments which might 
> be needed for parity blocks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to