[ 
https://issues.apache.org/jira/browse/HDFS-7877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16977333#comment-16977333
 ] 

zy.jordan commented on HDFS-7877:
---------------------------------

And I find this log
{quote}2019-11-18 18:50:24,707 INFO [DecommissionMonitor-0] BlockStateChange: 
BLOCK* InvalidateBlocks: add blk_51421301649_50350135968 to xxxxxip1:50010
2019-11-18 18:50:24,707 WARN [DecommissionMonitor-0] 
org.apache.hadoop.hdfs.server.blockmanagement.DecommissionManager: 
DatanodeAdminMonitor caught exception when processing node xxxxxxip1:50010.
java.lang.NullPointerException
 at 
org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.processExtraRedundancyBlocksOnInService(BlockManager.java:3550)
 at 
org.apache.hadoop.hdfs.server.blockmanagement.DecommissionManager.stopMaintenance(DecommissionManager.java:313)
 at 
org.apache.hadoop.hdfs.server.blockmanagement.DecommissionManager$Monitor.check(DecommissionManager.java:538)
 at 
org.apache.hadoop.hdfs.server.blockmanagement.DecommissionManager$Monitor.run(DecommissionManager.java:495)
 at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
 at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
 at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
 at 
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
 at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
 at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
 at java.lang.Thread.run(Thread.java:745)
2019-11-18 18:50:24,708 INFO [DecommissionMonitor-0] 
org.apache.hadoop.hdfs.server.blockmanagement.HeartbeatManager: Stopping 
maintenance of live node xxxxxxxip1:50010
{quote}
 
{quote}void processExtraRedundancyBlocksOnInService(
    final DatanodeDescriptor srcNode) {
    if (!namesystem.isPopulatingReplQueues()) {
        return;
    }
    final Iterator<BlockInfo> it = srcNode.getBlockIterator();
    int numOverReplicated = 0;
    while(it.hasNext()) {
        final BlockInfo block = it.next();
        BlockCollection bc = blocksMap.getBlockCollection(block);
        short expectedReplication = bc.getBlockReplication(); //this line is 
BlockManager.java:3550
        NumberReplicas num = countNodes(block);
        int numCurrentReplica = num.liveReplicas();
        if (numCurrentReplica > expectedReplication) {
            // over-replicated block 
             processOverReplicatedBlock(block, expectedReplication, null, null);
             numOverReplicated++;
         }
     }
     LOG.info("Invalidated " + numOverReplicated + " over-replicated blocks on 
" +
     srcNode + " during recommissioning");
}{quote}

> [Umbrella] Support maintenance state for datanodes
> --------------------------------------------------
>
>                 Key: HDFS-7877
>                 URL: https://issues.apache.org/jira/browse/HDFS-7877
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: datanode, namenode
>            Reporter: Ming Ma
>            Assignee: Ming Ma
>            Priority: Major
>             Fix For: 2.9.0, 3.0.0-beta1, 3.1.0
>
>         Attachments: HDFS-7877-2.patch, HDFS-7877.patch, 
> Supportmaintenancestatefordatanodes-2.pdf, 
> Supportmaintenancestatefordatanodes.pdf
>
>
> This requirement came up during the design for HDFS-7541. Given this feature 
> is mostly independent of upgrade domain feature, it is better to track it 
> under a separate jira. The design and draft patch will be available soon.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to