Re: [PR] HDFS-17280. Pipeline recovery should better end block in advance when bytes acked greater than half of blocksize. [hadoop]

via GitHub Sat, 20 Jan 2024 19:05:39 -0800


hfutatzhanghb commented on PR #6336:
URL: https://github.com/apache/hadoop/pull/6336#issuecomment-1902486702


   > @hfutatzhanghb Thanks for your contribution! Sorry I didn't get this 
proposal clearly. Would you mind to offer some more information about what 
issue do you meet, and what this PR could do? Thanks again.
   
   @Hexiaoqiao Sir, so sorry for repsonsing too late. Let me describe this PR 
in detail.
   ### 1.The goal of this PR
   Since we have 
[HDFS-16348](https://issues.apache.org/jira/browse/HDFS-16348), we can kick out 
SLOW node in pipeline when writing data to pipeline.  If we call 
addDatanode2ExistingPipeline() method, it will trigger transfer block process.
   
   Think about below situation : we have a cluster with block size equals to 
512MB,  if we have already written 500MB and one datanode was kicked out from 
pipeline, then add a new datanode to pipeline. It will transfer 500MB data to 
the new chosen datanode. This is not efficient.
   
   So this PR is trying to alleviate this effect, if we have already written 
over a half of block size, we can end this block in advance to avoid tranfering 
data.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] HDFS-17280. Pipeline recovery should better end block in advance when bytes acked greater than half of blocksize. [hadoop]

Reply via email to