I just set up my first Hadoop cluster. My cluster looks like:

Node1 - NameNode + ResourceManager
Node2 - SecondaryNameNode
Node3 - DataNode (+NodeManager)
Node4 - DataNode (+NodeManager)
Node5 - DataNode (+NodeManager)

Doing java's jps command on all machines looks good.
My hadoop-hduser-namenode-node1.log (and same on other machines) are looking 
fine too. I got a single warning (fs.name.dir only got one directory).
I started a mapreduce-example job wordcount (twice) using a 550 MB apache-log. 
Job ran, output seems fine.

My questions:

- My Web-UI http://namenode:8088/ shows 3 Active Nodes - Node State Running but 
I couldn't see any changes at all while my jobs were run.
Java Output after starting the job:
> 16/07/05 13:27:31 INFO mapreduce.Job: The url to track the job: 
> http://localhost:8080/

- Also Java output said 'split into 5 blocks'. So I'd like to see where blocks 
getting stored to be sure that my cluster is working fine. Can I somehow check 
where blocks are getting stored and if it replication works just like it should 
in theory.

- My Web-UI http://datanode:50075 only shows: 'DataNode on Datanode:50075'. I 
cannot click on overview or get any helpful information here. When I got my 
first hands on Hadoop I used a single-node-setup. There, this page had tons of 
information about the node-health/status/jobs and I was able to browse the 
HDFS. I'm missing this here, and I don't get why.

I appreciate every kind of help in advice.
Have a good day everyone.
Best Regards,
Mike.

Reply via email to