I have seen this cryptic error thrown in the context of having wrong JobTracker/ResourceManager port in the job config. Please check that one.
On 11/18/13 11:26 PM, "Giridhar Addepalli" <[email protected]> wrote: >Hi All, > >We are using 3.3.0 version of Oozie. >Trying to run mapr-reduce app from example/apps that gets shipped with >oozie. > >We are getting following error:: > >2013-11-18 23:20:41,156 INFO ActionStartXCommand:539 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@:start:] Start action >[0000985-131031060610521-oozie-root-W@:start:] with user-retry state : >userRetryCount [0], userRetryMax [0], userRetryInterval [10] >2013-11-18 23:20:41,157 WARN ActionStartXCommand:542 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@:start:] >[***0000985-131031060610521-oozie-root-W@:start:***]Action status=DONE >2013-11-18 23:20:41,157 WARN ActionStartXCommand:542 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@:start:] >[***0000985-131031060610521-oozie-root-W@:start:***]Action updated in DB! >2013-11-18 23:20:41,479 INFO ActionStartXCommand:539 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@mr-node] Start action >[0000985-131031060610521-oozie-root-W@mr-node] with user-retry state : >userRetryCount [0], userRetryMax [0], userRetryInterval [10] >2013-11-18 23:20:42,497 WARN MapReduceActionExecutor:542 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@mr-node] credentials is null >for the action >2013-11-18 23:20:43,974 WARN ActionStartXCommand:542 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@mr-node] Error starting action >[mr-node]. ErrorType [TRANSIENT], ErrorCode [JA009], Message [JA009: null] >org.apache.oozie.action.ActionExecutorException: JA009: null > at >org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecut >or.java:418) > at >org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.jav >a:396) > at >org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActio >nExecutor.java:739) > at >org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor >.java:889) > at >org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXComman >d.java:211) > at >org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXComman >d.java:59) > at org.apache.oozie.command.XCommand.call(XCommand.java:277) > at >org.apache.oozie.service.CallableQueueService$CompositeCallable.call(Calla >bleQueueService.java:326) > at >org.apache.oozie.service.CallableQueueService$CompositeCallable.call(Calla >bleQueueService.java:255) > at >org.apache.oozie.service.CallableQueueService$CallableWrapper.run(Callable >QueueService.java:175) > at >java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor. >java:886) > at >java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java >:908) > at java.lang.Thread.run(Thread.java:662) >Caused by: java.io.EOFException > at java.io.DataInputStream.readInt(DataInputStream.java:375) > at org.apache.hadoop.mapreduce.ID.readFields(ID.java:83) > at org.apache.hadoop.mapreduce.JobID.readFields(JobID.java:122) > at >org.apache.oozie.service.MaprJobClient.submitJob(MaprJobClient.java:292) > at >org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActio >nExecutor.java:724) > ... 10 more >2013-11-18 23:20:43,976 INFO ActionStartXCommand:539 - USER[gaddepa] >GROUP[-] TOKEN[] APP[map-reduce-wf] >JOB[0000985-131031060610521-oozie-root-W] >ACTION[0000985-131031060610521-oozie-root-W@mr-node] Next Retry, Attempt >Number [1] in [60,000] milliseconds > > >Our oozie server was running fine and we were able to submit jobs and they >were running . All of a sudden from 2 days we are getting this error. >Any known reason behind this error? > >Thanks, >Giridhar.
