I was looking at the web interface and found that some of my nodes have enormous amount of "Non DFS Used".
There is even a node with 800GB of "Non DFS Used" which is just ridiculous. I tried to remove them by doing: "hadoop namenode -format" and I also tried deleting "hadoop.tmp.dir" (in my case, which is /home/hadoop/hadoop_storage/tmp/). But when I start my cluster again, there it is again with thousands of giga bytes of "Non DFS Used". Can anyone tell me what "Non DFS Used" is and how to remove them forever? Thanks in advance.
