[ 
https://issues.apache.org/jira/browse/HDFS-11797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043089#comment-16043089
 ] 

Wei-Chiu Chuang commented on HDFS-11797:
----------------------------------------

Thanks Kuhu and Jing.

Quickly reviewed the patch, it does look like it fixed the same issue. If I 
understand the fix correctly, there are certain cases where block manager fails 
to remove stale replicas when block replica information is updated, causing 
inconsistency of corrupt replica count between block manager and other data 
structures. However, I am not sure if there are other (unfixed) cases where 
this happens as well.

[~brahmareddy] would you mind to take a look?

> BlockManager#createLocatedBlocks() can throw ArrayIndexOutofBoundsException 
> when corrupt replicas are inconsistent
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-11797
>                 URL: https://issues.apache.org/jira/browse/HDFS-11797
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Kuhu Shukla
>            Assignee: Kuhu Shukla
>            Priority: Critical
>         Attachments: HDFS-11797.001.patch
>
>
> The calculation for {{numMachines}} can be too less (causing 
> ArrayIndexOutOfBoundsException) or too many (causing NPE (HDFS-9958)) if data 
> structures find inconsistent number of corrupt replicas. This was earlier 
> found related to failed storages. This JIRA tracks a change that works for 
> all possible cases of inconsistencies.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to