Hi,
Looks like you tried to kill the job  and it was not being killed. this could 
be different proble,

I want to see earlier log when you submitted the job initially.

Regards,
Mohammad



On Monday, November 11, 2013 1:17 AM, renguihe <[email protected]> wrote:
 
Sorry that I have not provided relevant info about the problem,now i will try 
to describe this problem clearly because it is a little confusing.
When there are 3 or more this unique workflow running simultaneously(actually 
in prep status instead of running status,i guess custom action implemented by 
custom runs its code in prep status ), any other workflow(has nothing to do 
with the unique wf) can not run,always hang on PREP status.
And the 3 or more  unique workflow can not be killed ,the oozie.log shows I 
have killed it many times but the kill command does not work.

2013-11-11 07:44:02,449  INFO DagEngine:539 - USER[-] GROUP[-] TOKEN[-] APP[-] 
JOB[-] ACTION[-] User hadoop killed the WF job 
0000001-131111070843260-oozie-hado-W
2013-11-11 07:44:12,458  INFO DagEngine:539 - USER[-] GROUP[-] TOKEN[-] APP[-] 
JOB[-] ACTION[-] User hadoop killed the WF job 
0000001-131111070843260-oozie-hado-W
2013-11-11 07:44:25,988  INFO DagEngine:539 - USER[-] GROUP[-] TOKEN[-] APP[-] 
JOB[-] ACTION[-] User hadoop killed the WF job 
0000001-131111070843260-oozie-hado-W
The content of this unique workflow is as follows:
<workflow-app xmlns='uri:oozie:workflow:0.1' name='customtest'> <start 
to='detectdir'/> <action name='detectdir'>   <detect 
xmlns="uri:custom:detect-action:0.1">     <dirname>aaa</dirname>   </detect>  
<ok to="end"/>  <error to="fail"/> </action> <end name='end'/> <kill 
name="fail">   <message>Workflow failed, error 
message[${wf:errorMessage(wf:lastErrorNode())}]</message> 
</kill></workflow-app>The detectdir node is a custom action node,implemented by 
myself.its logic is just to do a while circulation to wait for the dir 
exists,like this:while(!dirname.exists())     {}so if the dir does not exist 
and I submit this unique workflow and its copies 3 or more ,any other workflow 
can not run.But if one or two ,they can work normally.They wait for the dir and 
do not influence other workflows.    here are some relevant logs,please see if 
it can help:2013-11-11 08:51:38,938  INFO DagEngine:539 - USER[-] GROUP[-] 
TOKEN[-] APP[-] JOB[-] ACTION[-] User
 hadoop killed the WFjob 0000005-131111082227563-oozie-hado-W2013-11-11 
08:51:47,523  INFO DagEngine:539 - USER[-] GROUP[-] TOKEN[-] APP[-] JOB[-] 
ACTION[-] User hadoop killed the WFjob 
0000001-131111082227563-oozie-hado-W2013-11-11 08:52:11,742  WARN 
ActionStartXCommand:542 - USER[root] GROUP[-] TOKEN[] APP['customtest'] 
JOB[0000001-131111082227563-oozie-hado-W] 
ACTION[0000001-131111082227563-oozie-hado-W@simpleEmail] 
[***0000001-131111082227563-oozie-hado-W@simpleEmail***]Action 
status=DONE2013-11-11 08:52:11,743  WARN ActionStartXCommand:542 - USER[root] 
GROUP[-] TOKEN[] APP['customtest'] JOB[0000001-131111082227563-oozie-hado-W] 
ACTION[0000001-131111082227563-oozie-hado-W@simpleEmail] 
[***0000001-131111082227563-oozie-hado-W@simpleEmail***]Action updated in 
DB!2013-11-11 08:52:11,779  WARN ActionStartXCommand:542 - USER[root] GROUP[-] 
TOKEN[] APP['customtest'3] JOB[0000005-131111082227563-oozie-hado-W]
 ACTION[0000005-131111082227563-oozie-hado-W@simpleEmail] E0816: Action 
pending=[false], status=[KILLED]. Skipping ActionStart Execution, Error Code: 
E08162013-11-11 08:52:11,780  INFO KillXCommand:539 - USER[root] GROUP[-] 
TOKEN[] APP['customtest'] JOB[0000001-131111082227563-oozie-hado-W] ACTION[-] 
STARTED WorkflowKillXCommand for 
jobId=0000001-131111082227563-oozie-hado-W2013-11-11 08:52:11,818  WARN 
CoordActionUpdateXCommand:542 - USER[root] GROUP[-] TOKEN[] APP['customtest'] 
JOB[0000001-131111082227563-oozie-hado-W] ACTION[-] E1100: Command precondition 
does not hold before execution, [, coord action is null], Error Code: 
E11002013-11-11 08:52:11,818  INFO KillXCommand:539 - USER[root] GROUP[-] 
TOKEN[] APP['customtest'] JOB[0000001-131111082227563-oozie-hado-W] ACTION[-] 
ENDED WorkflowKillXCommand for 
jobId=0000001-131111082227563-oozie-hado-W2013-11-11 08:52:11,824  WARN 
ActionEndXCommand:542 - USER[root] GROUP[-] TOKEN[] APP['customtest']
 JOB[0000001-13111108222 Waiting for your help sincerely!
Best Wishes!






HENRY

From: Mohammad Islam
Date: 2013-11-11 11:52
To: [email protected]; renguihe
Subject: Re: Problems with Custom Action Nodes
Hi Henry,
Can you please share the relevant contents of oozie.log and yous workflow.xml?
These will help others to address the problem.

Regards,
Mohammad



On Saturday, November 9, 2013 7:13 PM, renguihe <[email protected]> wrote:

Hi,
I try to use Custom Action Nodes.
The workflow can be  submitted but hangs on "PREP" status.
I wonder which step is wrong.
I followed the steps according this article .
http://www.infoq.com/articles/ExtendingOozie

I put a jar containing EmailActionExecutor class and emailAction.xsd in the 
oozie server lib.
I wonder how xsd knows to use the    EmailActionExecutor  class?
There is no expicit declaration in xsd.

thx! 




HENRY

Reply via email to