Re: what is the relation between the classes at the very beginning?

Steve Loughran Mon, 16 Mar 2009 04:09:54 -0700

Alex Zheng wrote:

I am a newer for Hadoop, and am reading the code of Hadoop for a week
now i am very puzzled by the relation of so many classes after i run :
bin/start-all.sh


I know there are JobTrackerInstrumentation, JobTracker, Namenode etc so what
is the order of their initialization?
and after bin/start-all.sh and before i run any job, what exits in the
system?

run jps -v to see what's up and about, netstat -p to list ports in useby the different processes.

The nodes are all designed to spin a bit waiting for their dependenciesto come up; you don't need to bring them up in a strict order (whichwould be namenode-datanode(s)-jobtracker-tasktracker(s)) for a full MRcluster.

I have tests that poll for the various ports to be open beforesubmitting work, and they sometimes get unhappy if you try submittingjobs straight after the job tracker appears live. If you are going tospin waiting for a job tracker to be visible, I would sleep a fewseconds after it's IPC port opens up before sending in work. This isclearly some race condition, but not anything I've sat down to look at,as it's only a startup and a 10s sleep makes it go away

Re: what is the relation between the classes at the very beginning?

Reply via email to