Re: How should we handle queue configuaration in hadoop?

arkady borkovsky Mon, 02 Jun 2008 23:35:54 -0700

HADOOP-3421 seems to combine 2 sorts of requirements:
   (a) requirements for resource manager for Hadoop (map reduce) jobs

(b) requirements for operations that can be performed on Hadoop(map reduce) jobs necessary to implement (a) and a certain class ofother "resource managers" with potentially different requirements.

One can imagine a resource manager being run on a single machineoutside a Hadoop grid. It would have a local database of jobs,queues, etc and their status. And it would talk to Hadoop to get thecurrent state of the running jobs, and to request to do somethingwith specific jobs -- kill, put to sleep, change the resource quota,etc. Its counterpart on Hadoop grid would not need to know all theconcepts that the "resource manager" operates with.


Such layered design may make it easier to answer os of the questions.

E.g. it kind of implies that (b) is part of Hadoop and belongs toJIRA, while (a) may be completely specific for different organizations.


2c


On Jun 2, 2008, at 10:39 PM, Vivek Ratan wrote:

I'd like to get some feedback on how to implement configuration for
queues.

Quick background: As part of the new Resource Manager in Hadoop
(HADOOP-3421), a single Hadoop installation supports one or morequeuesthat user submit jobs to. Eventually (it is hoped), an installationwill
support one or more Orgs, each with one of more queues.

The problem: Queues have attributes: a name, whether it supports
priorities, a 'guaranteed capacity', a list of allowed users, alist of
rejected users, and so on. How do we handle this configuration? Some
constraints:
* We'd like the default installation to have a single queue withdefault
values, so the system works out of the box, but an admin can configure
multiple queues, each with its own config values. Different
installations can have different number of queues.
* Orgs, queues, and users provide a hierarchy. You could set default
values for some config variables in the Org, and individual queuescould
override them. Similarly, individual users could override Org/queue
defaults. This is more of a long term goal. For V1, we can get awaywith
queue-specific configuration only.
* Some config values can be changed by the admin while the system is
running, and these need to be re-read by Hadoop within a reasonable
amount of time. For example, an admin may dynamically change the
guaranteed capacity of queues (if new machines are added to a cluster,
for example). You don't want to restart the JT to read new values.

What are the implementation choices we're facing?

- Where do we specify config values? It seems clear that
hadoop-default.xml should contain configuration for a single queuewithappropriate default values. If ad admin wants to set up multiplequeues,this information can go in a separate file. hadoop-site.xml? Ormaybe a
separate config file for queues? While having multiple config files
leads to problems in managing them, there are some supportingarguments
for a separate config file for queues: this file can be re-read
periodically (which avoids having to re-read hadoop-site.xml), thisfileis only read by the JobTracker, so there's no issue of overridingvalues
elsewhere.
- How do we specify config values? You can have more than onequeue, and
each queue has its attributes.
* You could have properties like "hadoop.scheduler.queue1.name",
"hadoop.scheduler.queue1.guaranteed_capacity",
"hadoop.scheduler.queue2.name"... Bit of a pain to write, and sincethe
number od queues is not known statically, how do you know how many of
these properties to read? You could have a separate property thattells
you how many queues there are, and then the JT can build the property
names dynamically. (HADOOP-3407 would also help here).
* You may want comma separated values. So"hadoop.scheduler.queue.names"
would have comma separated values for all queue names,
"hadoop.scheduler.queue.guaranteed_capacity" would have commaseparated
values for capacities for each queue. This can get very difficult to
maintain as you have to make sure the attribute values for each queue
show up in the right place among the comma separated values.
Do we have any examples of config settings where you can have adynamic
number of top-level entries, and each entry has multiple attributes?

Should this discussion be on a Jira? I hesitated, as there seems to be
more than one issue to resolve here.

Re: How should we handle queue configuaration in hadoop?

Reply via email to