[ 
https://issues.apache.org/jira/browse/HDFS-15737?focusedWorklogId=526158&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-526158
 ]

ASF GitHub Bot logged work on HDFS-15737:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 18/Dec/20 21:54
            Start Date: 18/Dec/20 21:54
    Worklog Time Spent: 10m 
      Work Description: NickyYe commented on pull request #2562:
URL: https://github.com/apache/hadoop/pull/2562#issuecomment-748338149


   > Thanks for the information - this may explain why HDFS-12703 was needed, 
as some exceptions which were not logged at that time, caused the decommission 
thread to stop running until the NN was restarted. The change there was to 
catch the exception.
   > 
   > The change here looks correct to me, but as the issue exists on the trunk 
branch, we should fix it there first, and then backport to 3.3, 3.2, 3.1 and 
2.10 so the fix is in place across all branches.
   
   Due to HDFS-14854, the fix on trunk could be a very different one, since it 
doesn't make sense to change the new interface with a boolean parameter to 
stopTrackingNode while DatanodeAdminBackoffMonitor does't need.
   
   Looks a better fix would be introduce a cancelledNodes to 
DatanodeAdminDefaultMonitor, just like DatanodeAdminBackoffMonitor . Then in 
stopTrackingNode, don't remove dn from outOfServiceNodeBlocks, but add it to 
cancelledNodes for further process.
   
   However, the change would be a little bit bigger.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 526158)
    Time Spent: 1.5h  (was: 1h 20m)

> Don't remove datanodes from outOfServiceNodeBlocks while checking in 
> DatanodeAdminManager
> -----------------------------------------------------------------------------------------
>
>                 Key: HDFS-15737
>                 URL: https://issues.apache.org/jira/browse/HDFS-15737
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>            Reporter: Ye Ni
>            Assignee: Ye Ni
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 2.10.2
>
>          Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> With CyclicIteration, remove an item while iterating causes either dead loop 
> or ConcurrentModificationException.
> This item should be removed by
> {{toRemove.add(dn);}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to