[jira] [Created] (OOZIE-3578) MapReduce counters cannot be used over 120
Denes Bodo created OOZIE-3578: - Summary: MapReduce counters cannot be used over 120 Key: OOZIE-3578 URL: https://issues.apache.org/jira/browse/OOZIE-3578 Project: Oozie Issue Type: Bug Components: core Affects Versions: 5.1.0 Reporter: Denes Bodo When we create a mapreduce action which then creates more than 120 counters then the following exception is thrown: {noformat} org.apache.hadoop.mapreduce.counters.Limits.checkCounters(Limits.java:101) org.apache.hadoop.mapreduce.counters.Limits.incrCounters(Limits.java:108) org.apache.hadoop.mapreduce.counters.AbstractCounterGroup.addCounter(AbstractCounterGroup.java:78) org.apache.hadoop.mapreduce.counters.AbstractCounterGroup.addCounterImpl(AbstractCounterGroup.java:95) org.apache.hadoop.mapreduce.counters.AbstractCounterGroup.findCounterImpl(AbstractCounterGroup.java:123) org.apache.hadoop.mapreduce.counters.AbstractCounterGroup.findCounter(AbstractCounterGroup.java:113) org.apache.hadoop.mapreduce.counters.AbstractCounterGroup.findCounter(AbstractCounterGroup.java:130) org.apache.hadoop.mapreduce.counters.AbstractCounters.findCounter(AbstractCounters.java:155) org.apache.hadoop.mapreduce.TypeConverter.fromYarn(TypeConverter.java:264) org.apache.hadoop.mapred.ClientServiceDelegate.getJobCounters(ClientServiceDelegate.java:383) org.apache.hadoop.mapred.YARNRunner.getJobCounters(YARNRunner.java:859) org.apache.hadoop.mapreduce.Job$8.run(Job.java:820) org.apache.hadoop.mapreduce.Job$8.run(Job.java:817) java.security.AccessController.doPrivileged(Native Method) javax.security.auth.Subject.doAs(Subject.java:422) org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1875) org.apache.hadoop.mapreduce.Job.getCounters(Job.java:817) org.apache.hadoop.mapred.JobClient$NetworkedJob.getCounters(JobClient.java:379) org.apache.oozie.action.hadoop.MapReduceActionExecutor.end(MapReduceActionExecutor.java:252) org.apache.oozie.command.wf.ActionEndXCommand.execute(ActionEndXCommand.java:183) org.apache.oozie.command.wf.ActionEndXCommand.execute(ActionEndXCommand.java:62) org.apache.oozie.command.XCommand.call(XCommand.java:291) org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:244) org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:56) org.apache.oozie.command.XCommand.call(XCommand.java:291) java.util.concurrent.FutureTask.run(FutureTask.java:266) org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:210) java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) java.lang.Thread.run(Thread.java:748) {noformat} It turned out if we use Oozie with Hadoop 3 the MR class called {{Limits}} is not initialised properly but with default values: https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/counters/Limits.java#L40 If we set the "mapreduce.job.counters.max" to 500 in mapred-site.xml or in core-site.xml has no positive effect. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] Subscription: Oozie Patch Available
Issue Subscription Filter: Oozie Patch Available (93 issues) Subscriber: ooziedaily Key Summary OOZIE-3574 JavaAction create incorrect fileSystem instance in addActionLibs method https://issues.apache.org/jira/browse/OOZIE-3574 OOZIE-3569 SSH Action should add checking success file https://issues.apache.org/jira/browse/OOZIE-3569 OOZIE-3568 Have large amount of log information “WARN messages [main] openjpa.MetaData” in jetty.log need to clean https://issues.apache.org/jira/browse/OOZIE-3568 OOZIE-3567 Oozie ShellAction should support absolute bash file path https://issues.apache.org/jira/browse/OOZIE-3567 OOZIE-3560 IDEA shows have some error in index.jsp https://issues.apache.org/jira/browse/OOZIE-3560 OOZIE-3482 Fix bug in CoordSubmitXCommand#validateCoordinatorJob https://issues.apache.org/jira/browse/OOZIE-3482 OOZIE-3480 Add windowactionstatus metrics in DBLiteWorkflowStoreService https://issues.apache.org/jira/browse/OOZIE-3480 OOZIE-3461 CoordMaterializeTriggerService code cleanup https://issues.apache.org/jira/browse/OOZIE-3461 OOZIE-3449 Make spark-2 as the default profile https://issues.apache.org/jira/browse/OOZIE-3449 OOZIE-3447 Run test case in local : It shows oozie-hsqldb-orm.xml exception https://issues.apache.org/jira/browse/OOZIE-3447 OOZIE-3418 Upgrade to Guava 27 https://issues.apache.org/jira/browse/OOZIE-3418 OOZIE-3404 The env variable of SPARK_HOME needs to be set when running pySpark https://issues.apache.org/jira/browse/OOZIE-3404 OOZIE-3375 Can't use empty in coordinator https://issues.apache.org/jira/browse/OOZIE-3375 OOZIE-3367 Using && in EL expressions in oozie bundle.xml files generates parse errors https://issues.apache.org/jira/browse/OOZIE-3367 OOZIE-3366 Update workflow status and subworkflow status on suspend command https://issues.apache.org/jira/browse/OOZIE-3366 OOZIE-3364 Rerunning Oozie bundle jobs starts the coordinators in indeterminate order https://issues.apache.org/jira/browse/OOZIE-3364 OOZIE-3362 When killed, SSH action should kill the spawned processes on target host https://issues.apache.org/jira/browse/OOZIE-3362 OOZIE-3335 Cleanup parseFilter methods https://issues.apache.org/jira/browse/OOZIE-3335 OOZIE-3328 Create Hive compatibility action executor to run hive actions using beeline https://issues.apache.org/jira/browse/OOZIE-3328 OOZIE-3319 Log SSH action callback error output https://issues.apache.org/jira/browse/OOZIE-3319 OOZIE-3301 Update NOTICE file https://issues.apache.org/jira/browse/OOZIE-3301 OOZIE-3274 Remove slf4j https://issues.apache.org/jira/browse/OOZIE-3274 OOZIE-3266 Coord action rerun support RERUN_SKIP_NODES option https://issues.apache.org/jira/browse/OOZIE-3266 OOZIE-3256 refactor OozieCLI class https://issues.apache.org/jira/browse/OOZIE-3256 OOZIE-3254 [coordinator] LAST_ONLY and NONE execution modes: possible OutOfMemoryError when there are too many coordinator actions to materialize https://issues.apache.org/jira/browse/OOZIE-3254 OOZIE-3199 Let system property restriction configurable https://issues.apache.org/jira/browse/OOZIE-3199 OOZIE-3196 Authorization: restrict world readability by user https://issues.apache.org/jira/browse/OOZIE-3196 OOZIE-3170 Oozie Diagnostic Bundle tool fails with NPE due to missing service class https://issues.apache.org/jira/browse/OOZIE-3170 OOZIE-3137 Add support for log4j2 in HiveMain https://issues.apache.org/jira/browse/OOZIE-3137 OOZIE-3135 Configure log4j2 in SqoopMain https://issues.apache.org/jira/browse/OOZIE-3135 OOZIE-3091 Oozie Sqoop Avro Import fails with "java.lang.NoClassDefFoundError: org/apache/avro/mapred/AvroWrapper" https://issues.apache.org/jira/browse/OOZIE-3091 OOZIE-3071 Oozie 4.3 Spark sharelib ueses a different version of commons-lang3 than Spark 2.2.0 https://issues.apache.org/jira/browse/OOZIE-3071 OOZIE-3063 Sanitizing variables that are part of openjpa.ConnectionProperties https://issues.apache.org/jira/browse/OOZIE-3063 OOZIE-3062 Set HADOOP_CONF_DIR for spark action https://issues.apache.org/jira/browse/OOZIE-3062 OOZIE-2952 Fix Findbugs warnings in oozie-sharelib-oozie https://issues.apache.org/jira/browse/OOZIE-2952 OOZIE-2834 ParameterVerifier logging non-useful warning for workflow definition https://issues.apache.org/jira/browse/OOZIE-2834 OOZIE-2812 SparkConfigurationService should support loading configurations from multiple Spark versions https://issues.apache.org/jira/browse/OOZIE-2812 OOZIE-2795 Create lib directory or symlink for Oozie CLI during packaging
[jira] [Assigned] (OOZIE-3066) Possibility to set retry-max globally
[ https://issues.apache.org/jira/browse/OOZIE-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qs Bao reassigned OOZIE-3066: - Assignee: Qs Bao > Possibility to set retry-max globally > - > > Key: OOZIE-3066 > URL: https://issues.apache.org/jira/browse/OOZIE-3066 > Project: Oozie > Issue Type: New Feature >Reporter: Gézapeti >Assignee: Qs Bao >Priority: Major > > Currently there is no way to set the equivalent of {{retry-max}} globally to > apply to every action. > It should be possible to set a default value to that to make workflow > execution more reliable. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Commented] (OOZIE-3066) Possibility to set retry-max globally
[ https://issues.apache.org/jira/browse/OOZIE-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17008611#comment-17008611 ] Andras Salamon commented on OOZIE-3066: --- Sounds like a good plan to me. [~qsbao] I've added you to the contributor list, just assign this Jira to yourself if you start working on it. > Possibility to set retry-max globally > - > > Key: OOZIE-3066 > URL: https://issues.apache.org/jira/browse/OOZIE-3066 > Project: Oozie > Issue Type: New Feature >Reporter: Gézapeti >Priority: Major > > Currently there is no way to set the equivalent of {{retry-max}} globally to > apply to every action. > It should be possible to set a default value to that to make workflow > execution more reliable. -- This message was sent by Atlassian Jira (v8.3.4#803005)