houman babai created OOZIE-3604:
-----------------------------------

             Summary: Oozie Luncher & Map-Reduce Action Complete Successfully 
However Oozie Still Fails the Action
                 Key: OOZIE-3604
                 URL: https://issues.apache.org/jira/browse/OOZIE-3604
             Project: Oozie
          Issue Type: Bug
          Components: core
    Affects Versions: 5.1.0
         Environment: 5.1.0-cdh-6.3.1
5.1.0-cdh-6.3.2
            Reporter: houman babai


I am using oozie 5.1.0-cdh6.3.1

In my workflow I have a mapreduce action, which generates over 300 counters. 
The oozie launcher & the mapreduce job successfully complete, however, oozie 
reports that:
{code:java}
Error Code: LimitExceededException

LimitExceededException: Too many counters: 121 max=120
{code}
I have updated mapred-site.xml.

The log for the *mapreduce job* reports success, in fact I can see all the 
counters & the actual output of the mapreduce job on hdfs.

In the oozie *launcher log* I can see:
 * mapreduce.job.counters.max : 8192
 * mapreduce.job.counters.groups.max : 100

I set the oozie logs to TRACE, and I can see the following WARN:
{code:java}
WARN org.apache.oozie.command.wf.ActionEndXCommand: SERVER[SERVER-NAME] 
USER[MY-NAME] GROUP[-] TOKEN[] APP[APP-NAME] 
JOB[0000012-200714223028181-oozie-oozi-W] 
ACTION[0000012-200714223028181-oozie-oozi-W@ACTION-NAME] Error ending action 
[ACTION-NAME]. ErrorType [ERROR], ErrorCode [LimitExceededException], Message 
[LimitExceededException: Too many counters: 121 max=120]

WARN org.apache.oozie.command.wf.ActionEndXCommand: SERVER[SERVER-NAME] 
USER[MY-NAME] GROUP[-] TOKEN[] APP[APP-NAME] 
JOB[0000012-200714223028181-oozie-oozi-W] 
ACTION[0000012-200714223028181-oozie-oozi-W@ACTION-NAME] Setting Action Status 
to [ERROR]
{code}
Looking through the oozie source code, I think the issue may be traced to [line 
in 
MapReduceActionExecutor|https://github.com/apache/oozie/blob/a40ab5361372aa73c9e4aa386a9c81bd21742aa4/core/src/main/java/org/apache/oozie/action/hadoop/MapReduceActionExecutor.java#L250]

I am guessing that for some reason, the context or the config passed into the 
[createJobClient|https://github.com/apache/oozie/blob/a40ab5361372aa73c9e4aa386a9c81bd21742aa4/core/src/main/java/org/apache/oozie/action/hadoop/MapReduceActionExecutor.java#L242]
 lack the mapreduce.job.counters.max configuration.

I tried several other things hoping that it would somehow work:
 * I added the mapreduce.job.counters.max as configuration into the action 
(inside the workflow)
 * I added the mapreduce.job.counters.max to action config default.xml

Nothing has helped.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to