Datanode runs out of disk space

Jan Lukavský Fri, 11 Feb 2011 01:24:37 -0800

Hi all,

we are using hadoop 0.20.2 with hbase 0.20.6 on about 30 nodes. Ourcluster is under heavy write load, causing hbase to do a lot ofcompactions, which in turn causes many files with many new blocks to becreated in HDFS. Now, the problem is, that several datanodes areperiodically running out of disk space (they consume the space veryrapidly, we have seen 5 TiB to be exhausted in single day). We haveinvestigated a bit this problem and created a patch inFSNamesystem.invalidateWorkForOneNode(). The patch changes selection ofnode to invalidate - in original version the first node is selected, wechanged this to random node. After aplying the patch the problem seemsto disappear and all our DNs are having constant and balanced diskusage. The question is, is our problem a general issue, or are wemissing something? What is the reason to take the first node toinvalidate, when this can potentially cause starvation of other nodes?


Thanx,
 Jan

Datanode runs out of disk space

Reply via email to