[jira] [Commented] (HDFS-7548) Corrupt block reporting delayed until datablock scanner thread detects it

Daryn Sharp (JIRA) Mon, 05 Jan 2015 09:35:59 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-7548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264803#comment-14264803
 ]


Daryn Sharp commented on HDFS-7548:
-----------------------------------

The patch appears to mostly duplicate existing code just to prevent 
{{BlockPoolSliceScanner#addBlock}} from updating the {{lastScanTime}}.  Since 
this method is only called in one place, I'd suggest adding a "scanNow" 
boolean.  Update the current caller to pass false, the new try-catch can pass 
true.

Regarding the new try-catch, why catch IOE and re-throw if not 
ChecksumException versus explicitly catching ChecksumException?

> Corrupt block reporting delayed until datablock scanner thread detects it
> -------------------------------------------------------------------------
>
>                 Key: HDFS-7548
>                 URL: https://issues.apache.org/jira/browse/HDFS-7548
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 2.5.0
>            Reporter: Rushabh S Shah
>            Assignee: Rushabh S Shah
>         Attachments: HDFS-7548.patch
>
>
> When there is one datanode holding the block and that block happened to be
> corrupt, namenode would keep on trying to replicate the block repeatedly but 
> it would only report the block as corrupt only when the data block scanner 
> thread of the datanode picks up this bad block.
> Requesting improvement in namenode reporting so that corrupt replica would be 
> reported when there is only 1 replica and the replication of that replica 
> keeps on failing with the checksum error.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HDFS-7548) Corrupt block reporting delayed until datablock scanner thread detects it

Reply via email to