What Robert said. It can especially happen if you use Cloudera Manager
in your environment, and the CM -> MapReduce configs have "Client
Overrides" set for things like heap sizes of tasks. This
administrative enforcement settings could unfortunately cause any
overrides of the mapred.child.java.opts (and others) passed by clients
to be ignored by the TaskTrackers.

On Fri, Dec 20, 2013 at 6:28 AM, Robert Kanter <[email protected]> wrote:
> I remember seeing something similar with java-opts where they were being
> set somewhere else (mapred-site?) and declared as final, so when Oozie
> tried to set them, it was ignored.  Perhaps you’re running into the same
> problem?
>
>
> On Thu, Dec 19, 2013 at 12:23 AM, Jose Fernandez <[email protected]> wrote:
>
>> Hi folks,
>>
>> I searched through the archives and googled my brains out and I'm flat out
>> stuck. I'm hoping someone can help me here :)  We're getting pretty
>> desperate.
>>
>> CDH 4.1.3
>> Oozie v3.2.0
>> Using MR2 (YARN)
>>
>> We have a Java action configured in workflow.xml thusly:
>>
>>     <action name="convert-file">
>>         <java>
>>             <job-tracker>${jobTracker}</job-tracker>
>>             <name-node>${nameNode}</name-node>
>>             <configuration>
>>                 <property>
>>                     <name>mapred.job.queue.name</name>
>>                     <value>${queueName}</value>
>>                 </property>
>>             </configuration>
>>
>> <main-class>com.sdl.hlab.infrastructure.SM2ApiResultsFileToHdfsConverter</main-class>
>>             <java-opts>-Djava.library.path=${sapNativePath}
>> -Dcom.sdl.sap.conf.directory="." -Xmx1900m
>> -Dlogback.configurationFile="conf/${SAP_ENV}/log4j.xml"
>> -Dlog4j.configuration="conf/${SAP_ENV}/log4j.xml"</java-opts>
>>             <arg>${filePath}</arg>
>>             <file>files/sap.properties</file>
>>         </java>
>>         <ok to="cleanup"/>
>>         <error to="convert-file-Email"/>
>>     </action>
>>
>> This action works perfectly on two different environments (all same CDH
>> and Oozie version), but not on our new third environment. In the first two
>> environments, the java-opts block is respected and I can see it in the Job
>> History logs in stdout under "Java System Properties". In the third
>> environment (which is new) none of the java-opts appear except for
>> java.library.path, which points to the wrong libraries.
>>
>> I checked everywhere for entries for mapred.child.java.opts and there
>> appears to be one entry buried in mapred-defaults.xml and that's it.
>>
>> I also tried to move the java opts to a property entry, no luck.
>>
>> The job log doesn't have anything interesting:
>>
>> 2013-12-19 00:41:05,419  INFO ActionStartXCommand:539 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@:start:] Start action
>> [0000012-131218165312793-oozie-oozi-W@:start:] with user-retry state :
>> userRetryCount [0], userRetryMax [0], userRetryInterval [10]
>> 2013-12-19 00:41:05,419  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@:start:]
>> [***0000012-131218165312793-oozie-oozi-W@:start:***]Action status=DONE
>> 2013-12-19 00:41:05,419  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@:start:]
>> [***0000012-131218165312793-oozie-oozi-W@:start:***]Action updated in DB!
>> 2013-12-19 00:41:05,561  INFO ActionStartXCommand:539 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] Start action
>> [0000012-131218165312793-oozie-oozi-W@convert-file] with user-retry state
>> : userRetryCount [0], userRetryMax [0], userRetryInterval [10]
>> 2013-12-19 00:41:07,226  WARN JavaActionExecutor:542 - USER[hdfs] GROUP[-]
>> TOKEN[] APP[hlab-oozie-workflow] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] credentials is
>> null for the action
>> 2013-12-19 00:41:09,812  INFO JavaActionExecutor:539 - USER[hdfs] GROUP[-]
>> TOKEN[] APP[hlab-oozie-workflow] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] checking
>> action, external ID [job_1387408789132_0015] status [RUNNING]
>> 2013-12-19 00:41:09,815  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file]
>> [***0000012-131218165312793-oozie-oozi-W@convert-file***]Action
>> status=RUNNING
>> 2013-12-19 00:41:09,815  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file]
>> [***0000012-131218165312793-oozie-oozi-W@convert-file***]Action updated
>> in DB!
>> 2013-12-19 00:41:28,470  INFO CallbackServlet:539 - USER[-] GROUP[-]
>> TOKEN[-] APP[-] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] callback for
>> action [0000012-131218165312793-oozie-oozi-W@convert-file]
>> 2013-12-19 00:41:28,755  INFO JavaActionExecutor:539 - USER[hdfs] GROUP[-]
>> TOKEN[] APP[hlab-oozie-workflow] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] action
>> completed, external ID [job_1387408789132_0015]
>> 2013-12-19 00:41:28,787  WARN JavaActionExecutor:542 - USER[hdfs] GROUP[-]
>> TOKEN[] APP[hlab-oozie-workflow] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] Launcher ERROR,
>> reason: Main class [our main class], main() threw exception,
>> conf/qa/our.properties (No such file or directory)
>> 2013-12-19 00:41:28,787  WARN JavaActionExecutor:542 - USER[hdfs] GROUP[-]
>> TOKEN[] APP[hlab-oozie-workflow] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] Launcher
>> exception: conf/qa/our.properties (No such file or directory)
>> java.io.FileNotFoundException: conf/qa/our.properties (No such file or
>> directory)
>>                 at java.io.FileInputStream.open(Native Method)
>>                 at java.io.FileInputStream.<init>(FileInputStream.java:120)
>>                 at java.io.FileInputStream.<init>(FileInputStream.java:79)
>>                 at
>> com.sdl.sap.util.BaseConfiguration.configurationFile(Configuration.scala:78)
>>                 at
>> com.sdl.sap.util.BaseConfiguration.loadProperties(Configuration.scala:88)
>>                 at
>> com.sdl.sap.util.BaseConfiguration.properties$lzycompute(Configuration.scala:38)
>>                 at
>> com.sdl.sap.util.BaseConfiguration.properties(Configuration.scala:38)
>>                 at
>> com.sdl.sap.util.BaseConfiguration.stringValue(Configuration.scala:49)
>>                 at
>> com.sdl.hlab.infrastructure.SM2ApiResultsFileToHdfsConverter$.buildFileInfo(SM2ApiResultsFileToHdfsConverter.scala:74)
>>                 at
>> com.sdl.hlab.infrastructure.SM2ApiResultsFileToHdfsConverter$.main(SM2ApiResultsFileToHdfsConverter.scala:68)
>>                 at
>> com.sdl.hlab.infrastructure.SM2ApiResultsFileToHdfsConverter.main(SM2ApiResultsFileToHdfsConverter.scala)
>>                 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
>> Method)
>>                 at
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>                 at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>>                 at java.lang.reflect.Method.invoke(Method.java:597)
>>                 at
>> org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:472)
>>                 at
>> org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
>>                 at
>> org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:399)
>>                 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:334)
>>                 at
>> org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:152)
>>                 at java.security.AccessController.doPrivileged(Native
>> Method)
>>                 at javax.security.auth.Subject.doAs(Subject.java:396)
>>                 at
>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1332)
>>                 at
>> org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:147)
>>
>> 2013-12-19 00:41:28,864  INFO ActionEndXCommand:539 - USER[hdfs] GROUP[-]
>> TOKEN[] APP[hlab-oozie-workflow] JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file] ERROR is
>> considered as FAILED for SLA
>> 2013-12-19 00:41:29,021  INFO ActionStartXCommand:539 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file-Email] Start
>> action [0000012-131218165312793-oozie-oozi-W@convert-file-Email] with
>> user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval
>> [10]
>> 2013-12-19 00:41:29,112  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file-Email]
>> [***0000012-131218165312793-oozie-oozi-W@convert-file-Email***]Action
>> status=DONE
>> 2013-12-19 00:41:29,113  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file-Email]
>> [***0000012-131218165312793-oozie-oozi-W@convert-file-Email***]Action
>> updated in DB!
>> 2013-12-19 00:41:29,259  INFO ActionStartXCommand:539 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file-fail] Start
>> action [0000012-131218165312793-oozie-oozi-W@convert-file-fail] with
>> user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval
>> [10]
>> 2013-12-19 00:41:29,260  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file-fail]
>> [***0000012-131218165312793-oozie-oozi-W@convert-file-fail***]Action
>> status=DONE
>> 2013-12-19 00:41:29,260  WARN ActionStartXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W]
>> ACTION[0000012-131218165312793-oozie-oozi-W@convert-file-fail]
>> [***0000012-131218165312793-oozie-oozi-W@convert-file-fail***]Action
>> updated in DB!
>> 2013-12-19 00:41:29,440  WARN CoordActionUpdateXCommand:542 - USER[hdfs]
>> GROUP[-] TOKEN[] APP[hlab-oozie-workflow]
>> JOB[0000012-131218165312793-oozie-oozi-W] ACTION[-] E1100: Command
>> precondition does not hold before execution, [, coord action is null],
>> Error Code: E1100
>>
>> The FIleNotFoundException is the result of the property
>> com.sdl.sap.conf.directory not being set.
>>
>> Any help would be really appreciated!
>> </pre><font face="arial" size="2" color="#736F6E">
>>
>>
>>
>> <a href="
>> http://www.sdl.com/?utm_source=Email&utm_medium=Email%2BSignature&utm_campaign=SDL%2BStandard%2BEmail%2BSignature
>> ">
>> <img src="
>> http://www.sdl.com/Content/themes/common/images/SDL_logo_strapline_GCEM_EmailSig_150x68px.jpg";
>> border=0><br><br>www.sdl.com
>> </a><br><br>
>>
>> <font face="arial" size="1" color="#736F6E">
>>
>> <b>SDL PLC confidential, all rights reserved.</b>
>>
>> If you are not the intended recipient of this mail SDL requests and
>> requires that you delete it without acting upon or copying any of its
>> contents,
>> and we further request that you advise us.<BR>
>> SDL PLC is a public limited company registered in England and Wales.
>> Registered number: 02675207.<BR>
>> Registered address: Globe House, Clivemont Road, Maidenhead, Berkshire SL6
>> 7DY, UK.
>> </font>
>>
>>
>> This message has been scanned for malware by Websense. www.websense.com
>>



-- 
Harsh J

Reply via email to