Hi Mayuran, Do you do all of your uploads of data into your Hadoop cluster from node001 and node002?
If so, keep in mind that one of your replicas will always be written on localhost in the case that it is part of the cluster. You should consider running the rebalancer to even up your space usage. -Todd On Tue, Aug 11, 2009 at 11:09 AM, Mayuran Yogarajah < [email protected]> wrote: > I have a 6 node cluster running Hadoop 0.18.3. I'm trying to figure out > how the data was spread out like this: > > node001 94.15% > node002 94.16% > node003 48.22% > node004 47.85% > node005 48.12% > node006 43.18% > Node 001 (NN) and node 002( secondary NN) both got full, while the other > data nodes had more space left. I had assumed that Hadoop would distribute > more blocks to nodes 3-6 since they had much more space, but it ended up > filling up nodes1 and 2. Is this expected? > > thanks, > M > >
