Hi,

I want to put Hadoop into my webapp and have it start up when the servlet starts up. Based on the shell scripts, I think I would need to do the following:

1. Package the appropriate libraries and config files (hadoop-site, slaves, etc.) into the webapp

2. (based on start-all.sh)
a. Start the namenode: org.apache.hadoop.dfs.NameNode (need to look into what is in the main() method to make sure I construct/ invoke this correctly)
     b. Start the datanode:  org.apache.hadoop.dfs.DataNode
     c. Start the Jobtracker:  org.apache.hadoop.mapred.JobTracker
     d. Start the TaskTracker: org.apache.hadoop.mapred.TaskTracker

I am new to Hadoop, so is this reasonable?  What am I missing?

Also, is it possible to dynamically register slave nodes? I have been looking a little bit at zeroconf/bonjour network stuff and was wondering if it could be used to bring resources online automatically (would limit the nodes to a subnet, but that is fine for my needs).

Thanks,
Grant



--------------------------
Grant Ingersoll
Sr. Software Engineer
Center for Natural Language Processing
Syracuse University
335 Hinds Hall
Syracuse, NY 13244
http://www.cnlp.org



Reply via email to