Hi All,
Me and Surendra have been lately trying to minimise the impact of Block Reports 
on Namenode in huge cluster. We observed in a huge cluster, about 10k 
datanodes, the periodic block reports impact the Namenode performance adversely.
We have been thinking to restrict the block reports to be triggered only during 
Namenode startup or in case of failover and eliminate the periodic block report.
The main purpose of block report is to get a corrupt blocks recognised, so as a 
follow up we can maintain a service at datanode to run periodically to check if 
the block size in memory is same as that reported to namenode, and the datanode 
can alarm the namenode in case of any suspect,(We still need to plan this.)

At the datanode side, a datanode can send a BlockReport or restore its actual 
frequency in case during the configured time period, the Datanode got shutdown 
or lost connection with the namenode, say if the datanode was supposed to send 
BR at 2100 hrs, if during the last 6 hrs there has been any failover or loss of 
connection between the namenode and datanode, it will trigger BR normally, else 
shall skip sending the BR

Let us know thoughts/challenges/improvements in this.

-Ayush



---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to