[ 
https://issues.apache.org/jira/browse/HDFS-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554727#comment-13554727
 ] 

Todd Lipcon commented on HDFS-4288:
-----------------------------------

This now seems to be failing on precommit builds even though it passed its own 
precommit build above:
java.lang.AssertionError: Number of blocks is inconcistent with the array length
        at 
org.apache.hadoop.hdfs.protocol.BlockListAsLongs.getNumberOfBlocks(BlockListAsLongs.java:204)
        at 
org.apache.hadoop.hdfs.protocol.BlockListAsLongs$BlockReportIterator.hasNext(BlockListAsLongs.java:153)
        at 
org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.processFirstBlockReport(BlockManager.java:1672)
        at 
org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.processReport(BlockManager.java:1558)
        at 
org.apache.hadoop.hdfs.server.blockmanagement.TestBlockManager.testSafeModeIBR(TestBlockManager.java:511)

I'm able to reproduce locally as well. Not sure what changed on trunk between 
the precommit run last week and now, but seems to be causing issues. I'll 
revert this for now.
                
> NN accepts incremental BR as IBR in safemode
> --------------------------------------------
>
>                 Key: HDFS-4288
>                 URL: https://issues.apache.org/jira/browse/HDFS-4288
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 0.23.0, 2.0.0-alpha, 3.0.0
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>            Priority: Critical
>             Fix For: 3.0.0, 2.0.3-alpha
>
>         Attachments: HDFS-4288.branch-23.patch, HDFS-4288.patch
>
>
> If a DN is ready to send an incremental BR and the NN goes down, the DN will 
> repeatedly try to reconnect.  The NN will then process the DN's incremental 
> BR as an initial BR.  The NN now thinks the DN has only a few blocks, and 
> will ignore all subsequent BRs from that DN until out of safemode -- which it 
> may never do because of all the "missing" blocks on the affected DNs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to