Hey Keith, Sorry for the late response here (I had meant to reply but I believe I got distracted and forgot all about it):
I agree with you on all counts. The config is indeed for service-level slots. My reply was to only correct Kartheek's assumptions. Regarding documentation - I'd love to be able to correct them up myself, but lack the focussed time at the moment to do so right away. I am willing to review and commit it in for you though, if you're willing to contribute! Please just let me know the JIRA after you've filed one and I will track it. Note that if you use YARN to run the new MR2 code (MR API is the same, just the platform/submission-execution model has changed), the concept of hard slots have gone away and presently the slots are determined via the job's memory request (mapreduce.{map/reduce}.memory.mb) against a NodeManager's total offered memory for service. There is no longer a single hard config that controls max number of tasks that may run simultaneously per node (but can be achieved via some node manager/scheduler memory resource config hacks, ending up to be brittle though). CPU-specific requests are coming soon for YARN: https://issues.apache.org/jira/browse/MAPREDUCE-4327 On Wed, Jun 6, 2012 at 10:38 PM, Keith Wiley <kwi...@keithwiley.com> wrote: > > On Jun 6, 2012, at 03:42 , Harsh J wrote: > >>> I think mapred.tasktracker.map.tasks.maximum sets the number of map >> tasks and not slots. >> >> This is incorrect. The property does configure slots. Please also see >> http://wiki.apache.org/hadoop/HowManyMapsAndReduces and >> http://wiki.apache.org/hadoop/FAQ#I_see_a_maximum_of_2_maps.2BAC8-reduces_spawned_concurrently_on_each_TaskTracker.2C_how_do_I_increase_that.3F >> for more. > > > But Harsh, wouldn't you agree that the first reference you provided above is > talking about the number of tasks spawned for a given job at job-runtime and > not the number of slots hard-configured into the cluster at cluster-spinup > time? > > Incidentally, the second reference above is partially broken. It attempts to > offer links to dig into further detail about > mapred.tasktracker.map.tasks.maximum and > mapred.tasktracker.reduce.tasks.maximum, but the links are broken. For > example, one of the two broken links is: > > http://hadoop.apache.org/common/docs/current/hadoop-default.html#mapred.tasktracker.map.tasks.maximum > > It's still broken even if you remove the anchor from the end of the URL, > which is to say the hadoop-default.html webpage doesn't even exist. > > In fact, it is difficult find any official documentation on those properties > (Google searches for the terms do not provide links to any proper > documentation within apache, but rather just lots of back and forth forum > discussions about the properties). One thing I did find was a claim that > those properties are deprecated in 2.0.0: > > http://hadoop.apache.org/common/docs/current/hadoop-project-dist/hadoop-common/DeprecatedProperties.html > > That page indicates that they were replaced with equivalents in which the > first component is now 'mapreduce', not 'mapred'. Even with the new terms > however, Google still doesn't link to any formal documentation describing > those properties. In fact, I have yet to find a webpage anywhere which > officially states the purpose/effect of > mapred(uce).tasktracker.map.tasks.maximum. > > That said, I agree that the consensus of discussion and description seems to > imply that these properties have a cluster-level (not job-level) effect on > the number of map/reduce slots on the cluster, not the number of tasks > spawned for a given job. Such a concept obviously convolutes the intuition > that slots correspond to cores as I suggested in an earlier post and I > apologize for that. > > ________________________________________________________________________________ > Keith Wiley kwi...@keithwiley.com keithwiley.com > music.keithwiley.com > > "Yet mark his perfect self-contentment, and hence learn his lesson, that to be > self-contented is to be vile and ignorant, and that to aspire is better than > to > be blindly and impotently happy." > -- Edwin A. Abbott, Flatland > ________________________________________________________________________________ > -- Harsh J