Re: CombineFileInputFormat not producing multiple mappers

Keith Wiley Thu, 29 Apr 2010 23:57:48 -0700

Yep, that was part of it. Thank you. Also, I was not settingsplittable true for the Combined Input because I knew the containedfiles themselves were no splittable. Setting the Combined Input'ssplittable to true appears to have been important as well.


Thank you.


On 2010, Apr 29, at 11:53 PM, Aleksandar Stupar wrote:

Hi,
if the mapred.max.split.size is not set (and it's not by default)than CombineFileInputFormatonly takes racks in account when grouping blocks. So if you set thisproperty it will take alsoblock placement on machines into account and you should get multiplemappers.
Hope this helps,
Aleksandar Stupar.




________________________________
From: Keith Wiley <kwi...@keithwiley.com>
To: common-user@hadoop.apache.org
Sent: Thu, April 29, 2010 11:23:35 PM
Subject: CombineFileInputFormat not producing multiple mappers
I am using CombineFileInputFormat and CombineFileSplit to groupsmall input files as fed to the mappers. The job runs properly andthe output is correct, but I get only one mapper task, so I lose allmy paralleization in the map stage.
I realize I'm not providing much detail yet because I'm not surewhat to say. Feel free to ask questions for clarification.
What might cause this problem and how might I diagnose -- must lessfix -- it?
Thank you.

________________________________________________________________________________
Keith Wiley              kwi...@keithwiley.com              www.keithwiley.com
"And what if we picked the wrong religion? Every week, we're justmaking God
madder and madder!"
 -- Homer Simpson
________________________________________________________________________________



________________________________________________________________________________

Keith Wiley kwi...@keithwiley.com keithwiley.commusic.keithwiley.com


"Luminous beings are we, not this crude matter."
                                           --  Yoda
________________________________________________________________________________

Re: CombineFileInputFormat not producing multiple mappers

Reply via email to