hfutatzhanghb commented on PR #6336: URL: https://github.com/apache/hadoop/pull/6336#issuecomment-1902486702
> @hfutatzhanghb Thanks for your contribution! Sorry I didn't get this proposal clearly. Would you mind to offer some more information about what issue do you meet, and what this PR could do? Thanks again. @Hexiaoqiao Sir, so sorry for repsonsing too late. Let me describe this PR in detail. ### 1.The goal of this PR Since we have [HDFS-16348](https://issues.apache.org/jira/browse/HDFS-16348), we can kick out SLOW node in pipeline when writing data to pipeline. If we call addDatanode2ExistingPipeline() method, it will trigger transfer block process. Think about below situation : we have a cluster with block size equals to 512MB, if we have already written 500MB and one datanode was kicked out from pipeline, then add a new datanode to pipeline. It will transfer 500MB data to the new chosen datanode. This is not efficient. So this PR is trying to alleviate this effect, if we have already written over a half of block size, we can end this block in advance to avoid tranfering data. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
