[ 
https://issues.apache.org/jira/browse/TEZ-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14530410#comment-14530410
 ] 

Jeff Zhang commented on TEZ-1961:
---------------------------------

[~sseth]  indicating NoRunningDag via a valid RPC response looks like a little 
complicated. I have to add one flag to  RPC response of getVertexStatus & 
getDAGStatus and check that flag in these 2 methods. 
After more deep dive, I found that this "No running dag" issue only happens for 
non-session mode. In session mode, each dag submission will return rpc response 
after the dag is set in DAGAppMaster.  But in non-session mode, we generate 
dagId in client side and don't wait for dag been set in DAGAppMaster. I am 
working on one patch to return DAGClientImpl to client after dag been set in 
DAGAppMaster.  Almost done, still need to fix some test failure. 



> Remove misleading exception "No running dag" from AM logs
> ---------------------------------------------------------
>
>                 Key: TEZ-1961
>                 URL: https://issues.apache.org/jira/browse/TEZ-1961
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Siddharth Seth
>            Priority: Critical
>         Attachments: TEZ-1961-1.patch
>
>
> {code}
> 15/01/14 16:45:06 INFO ipc.Server: IPC Server handler 0 on 51000, call 
> org.apache.tez.dag.api.client.rpc.DAGClientAMProtocolBlockingPB.getDAGStatus 
> from  Call#0 Retry#0
> org.apache.tez.dag.api.TezException: No running dag at present
>       at 
> org.apache.tez.dag.api.client.DAGClientHandler.getDAG(DAGClientHandler.java:84)
>       at 
> org.apache.tez.dag.api.client.DAGClientHandler.getACLManager(DAGClientHandler.java:151)
>       at 
> org.apache.tez.dag.api.client.rpc.DAGClientAMProtocolBlockingPBServerImpl.getDAGStatus(DAGClientAMProtocolBlockingPBServerImpl.java:94)
>       at 
> org.apache.tez.dag.api.client.rpc.DAGClientAMProtocolRPC$DAGClientAMProtocol$2.callBlockingMethod(DAGClientAMProtocolRPC.java:7375)
>       at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:617)
>       at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:962)
>       at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2041)
>       at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2037)
>       at java.security.AccessController.doPrivileged(Native Method)
>       at javax.security.auth.Subject.doAs(Subject.java:415)
>       at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
>       at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2035)
> 15/01/14 16:45:06 INFO client.DAGClientImpl: DAG initialized: 
> CurrentState=Running
> {code}
> This exception shows up fairly often and isn't very relevant - queries before 
> a DAG is submitted to the AM.
> This is very misleading, especially for folks new to Tez, and should be 
> removed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to