Re: [jira] Updated: (HADOOP-1301) resource management proviosioning for Hadoop

Nigel Daley Fri, 04 Jan 2008 11:16:35 -0800

With this new contrib component, I have added contrib/hod as a Jiracomponent.


On Jan 4, 2008, at 10:23 AM, Nigel Daley (JIRA) wrote:

[ https://issues.apache.org/jira/browse/HADOOP-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nigel Daley updated HADOOP-1301:
--------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)
resource management proviosioning for Hadoop
--------------------------------------------

                Key: HADOOP-1301
URL: https://issues.apache.org/jira/browse/HADOOP-1301
            Project: Hadoop
         Issue Type: New Feature
         Components: mapred
   Affects Versions: 0.16.0
           Reporter: Pete Wyckoff
           Assignee: Hemanth Yamijala
            Fix For: 0.16.0
Attachments: hod-hadoop.patch, hod-hadoop.v2.patch, hod-hadoop.v3.patch, hod-hadoop.v4.patch, hod-open-4.tar.gz, hod.0.2.2.tar.gz
The Hadoop On Demand (HOD) project addresses the provisioning andmanaging of MapReduce instances on cluster resources. With HOD,the MapReduce user interacts with the cluster solely through aself-service interface and the JT, TT info ports. The user neverneeds to log into the cluster or even have an account on thecluster for that matter. HOD allocates nodes, provisions MapReduce(and optionally HDFS) on the cluster and when the user is donewith MapReduce jobs, cleanly shuts down MapReduce and de-allocatesthe nodes (i.e., re-introducing them to the pool of availableresources in the cluster).Using HOD, a cluster can be shared among different users in a fairand efficient manner. HOD is not a replacement or re-implementation of a traditional resource manager. HOD isimplemented using the resource manager paradigm and at present isenvisioned supporting Torque and Condor out of the box. It alsosupports "static" resources, i.e., a dedicated set of resourcesnot using a resource manager.HOD is also self provisioning and, thus, can be used on systemssuch as EC2 or a campus cluster not already running MapReducesoftware or a resouce manager. Figure 1 depicts a cluster usingHOD. As the figure shows, the user never logs into the clusteritself. The user's jobs run as the 'hod' user (a configurable unixid).The user interacts with MapReduce and the cluster using the hodshell, hodsh. Once in the hodsh, the user can allocate/de-allocatenodes and automatically run JT, TTs, NN, DNs on those nodeswithout knowing the specifics of which nodes are running which orlogging into any of those boxes. HOD transparently masks failuresby allocating nodes to replace failed nodes. Once the user hasallocated nodes, she can run /bin/MapReduce my1.jar and then /bin/MapReduce my2.jar ... from within the hod shell whichautomatically generates the configuration file for the MapReducescript. When done, the user will exit the shell.The hod shell has an automatic timeout so that users cannot hogresources they aren't using. The timeout applies only when thereis no MapReduce job running. In addition, hod also has the optionof tracking and enforcing user/group resource limits.Optionally, HOD can run dedicated log and directory services inthe cluster. The log services are a central repository forcollecting and retrieving Hadoop logs for any given job. Thedirectory service provides an easy way to inspect what's runningin the cluster or for the end user and html interfacing forgetting to their JT and TT info ports.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: [jira] Updated: (HADOOP-1301) resource management proviosioning for Hadoop

Reply via email to