Maps are failing if combiner is enabled
---------------------------------------

                 Key: PIG-1803
                 URL: https://issues.apache.org/jira/browse/PIG-1803
             Project: Pig
          Issue Type: Bug
            Reporter: Alex Rovner
             Fix For: 0.7.0


We are constantly hitting the java heap space memory issue if the combiner is 
enabled on our jobs.

Configs:
pig.cachedbag.memusage=20
io.sort.mb=300
pig.exec.nocombiner=false
mapred.child.java.opts=-Xmx750m

Sample job:
{noformat} 
A = LOAD '$INPUT' USING 
com.contextweb.pig.CWHeaderLoader('$WORK_DIR/schema/rpt.xml');
AA = foreach A GENERATE checkPointStart, PublisherId, TagId,
ContextCategoryId,Impressions, Clicks, Actions;

DESCRIBE AA;

B = GROUP AA BY (checkPointStart, PublisherId, TagId,
ContextCategoryId);

result = FOREACH B GENERATE group, SUM(AA.Impressions) as Impressions, 
SUM(AA.Clicks) as Clicks, SUM(AA.Actions) as Actions;

DESCRIBE result;

STORE result INTO '$OUTPUT' USING com.contextweb.pig.CWHeaderStore();
{noformat} 

Mapper Error Log:
2011-01-12 18:43:22,084 FATAL org.apache.hadoop.mapred.Child: Error running 
child : java.lang.OutOfMemoryError: Java heap space
        at 
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.<init>(MapTask.java:799)
        at 
org.apache.hadoop.mapred.MapTask$NewOutputCollector.<init>(MapTask.java:549)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:631)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:217)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063)
        at org.apache.hadoop.mapred.Child.main(Child.java:211)


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to