[
https://issues.apache.org/jira/browse/HDFS-11797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16047116#comment-16047116
]
Yongjun Zhang commented on HDFS-11797:
--------------------------------------
Thanks you all for looking into this issue.
Hi [~kshukla], thanks for reporting and working the issue, I assume the release
you are running doesn't have HDFS-11445 fix.
My understanding of HDFS-11445 is, when we tried to remove a corrupt replica,
we only removed it from blockMap, and we "forgot" to remove it from the
corruptReplicaMap, thus caused the inconsistency.
Hi [~daryn], if my understanding is correct here, the fix you mentioned at
https://issues.apache.org/jira/browse/HDFS-11797?focusedCommentId=16042960&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16042960
could be a follow-up jira. Do you agree?
Thanks.
> BlockManager#createLocatedBlocks() can throw ArrayIndexOutofBoundsException
> when corrupt replicas are inconsistent
> ------------------------------------------------------------------------------------------------------------------
>
> Key: HDFS-11797
> URL: https://issues.apache.org/jira/browse/HDFS-11797
> Project: Hadoop HDFS
> Issue Type: Bug
> Reporter: Kuhu Shukla
> Assignee: Kuhu Shukla
> Priority: Critical
> Attachments: HDFS-11797.001.patch
>
>
> The calculation for {{numMachines}} can be too less (causing
> ArrayIndexOutOfBoundsException) or too many (causing NPE (HDFS-9958)) if data
> structures find inconsistent number of corrupt replicas. This was earlier
> found related to failed storages. This JIRA tracks a change that works for
> all possible cases of inconsistencies.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]