Re: possibly Pig throttles the number of mappers

Alan Gates Wed, 23 Mar 2011 17:56:15 -0700

What version of Pig are you using? Starting in 0.8 Pig will combinesmall blocks into a single map. This prevents jobs that actually arereading small amounts of data from taking a lot of slots on thecluster. You can turn this off by adding -Dpig.noSplitCombination=true to your command line.


Alan.


On Mar 23, 2011, at 5:45 PM, Dexin Wang wrote:

And the nodes are pretty lightly loaded (~1.0) and there's plenty offree
memory. Now I'm seeing 2 mappers per node. Very much under-utilized.
On Wed, Mar 23, 2011 at 1:39 PM, Dexin Wang <[email protected]>wrote:
Hi,

We've seen a strange problem where some Pig jobs would just run fewer
mappers concurrently than the mapper capacity. Specifically we havea 10node cluster and each is configured to have 12 mappers. Normally wehave 120mappers running. But for some Pig jobs it will only have 10 mappersrunning(while nothing else is running), and actually appears to be 1mapper per
node.
We have not noticed the same problem with other non-Pig hadoop job.Anyone
has experienced the same thing and have any explanation or remedy?

Thanks!
Dexin

Re: possibly Pig throttles the number of mappers

Reply via email to