jianghuazhu commented on pull request #3861: URL: https://github.com/apache/hadoop/pull/3861#issuecomment-1039826045
Here are some examples of online clusters. We construct a block device file such as:  This file is non-standard. This kind of file is found when DirectoryScanner is working. log: ` 2022-02-15 11:24:10,286 WARN org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl: Block:1073741828 is not a regular file. ` ` 2022-02-15 11:24:10,286 WARN org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.FsDatasetImpl: Reporting the block blk_1073741828_0 as corrupt due to length mismatch ` Then DataNode will tell NameNode that there are some unqualified blocks through NameNodeRpcServer#reportBadBlocks(). After the NameNode gets the data, it will process it further. After a period of time, the DataNode will automatically clean up these unqualified replica data.  Can you help review this pr again, @jojochuang . Thank you so much. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
