[ 
https://issues.apache.org/jira/browse/HDFS-16622?focusedWorklogId=778501&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-778501
 ]

ASF GitHub Bot logged work on HDFS-16622:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 06/Jun/22 02:53
            Start Date: 06/Jun/22 02:53
    Worklog Time Spent: 10m 
      Work Description: ZanderXu opened a new pull request, #4407:
URL: https://github.com/apache/hadoop/pull/4407

   JIRA: [HDFS-16622](https://issues.apache.org/jira/browse/HDFS-16622).  
addRDBI in IncrementalBlockReportManager may remove the block with bigger GS.
   I suspect there is a bug in function addRDBI(ReceivedDeletedBlockInfo 
rdbi,DatanodeStorage storage)(line 250).
   Bug code in the for loop:
   synchronized void addRDBI(ReceivedDeletedBlockInfo rdbi,
         DatanodeStorage storage) {
       // Make sure another entry for the same block is first removed.
       // There may only be one such entry.
       for (PerStorageIBR perStorage : pendingIBRs.values()) {
         if (perStorage.remove(rdbi.getBlock()) != null) {
           break;
         }
       }
       getPerStorageIBR(storage).put(rdbi);
     }
   Removed the GS of the Block in ReceivedDeletedBlockInfo may be greater than 
the GS of the Block in rdbi. And NN will invalidate the Replicate will small GS 
when complete one block.




Issue Time Tracking
-------------------

            Worklog Id:     (was: 778501)
    Remaining Estimate: 0h
            Time Spent: 10m

> addRDBI in IncrementalBlockReportManager may remove the block with bigger GS.
> -----------------------------------------------------------------------------
>
>                 Key: HDFS-16622
>                 URL: https://issues.apache.org/jira/browse/HDFS-16622
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: ZanderXu
>            Assignee: ZanderXu
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> In our production environment,  there is a strange missing block, according 
> to the log, I suspect there is a bug in function 
> addRDBI(ReceivedDeletedBlockInfo rdbi,DatanodeStorage storage)(line 250).
> Bug code in the for loop:
> {code:java}
> synchronized void addRDBI(ReceivedDeletedBlockInfo rdbi,
>       DatanodeStorage storage) {
>     // Make sure another entry for the same block is first removed.
>     // There may only be one such entry.
>     for (PerStorageIBR perStorage : pendingIBRs.values()) {
>       if (perStorage.remove(rdbi.getBlock()) != null) {
>         break;
>       }
>     }
>     getPerStorageIBR(storage).put(rdbi);
>   }
> {code}
> Removed the GS of the Block in ReceivedDeletedBlockInfo may be greater than 
> the GS of the Block in rdbi. And NN will invalidate the Replicate will small 
> GS when complete one block. 
> So If there is only one replicate for one block, there is a possibility of 
> missingblock because of this wrong logic. 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to