[ 
https://issues.apache.org/jira/browse/TEZ-3834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated TEZ-3834:
---------------------------------
    Attachment: TEZ-3834.003.patch

Thanks for the review, [~jlowe]. Put up a new 003 patch based on you comment. 
Let me know if we need to tidy this code up. Also if one registration fails, 
should we stop trying to unregister the remaining services as in the current 
behavior?

> TaskSchedulerManager NullPointerException during shutdown when failed to start
> ------------------------------------------------------------------------------
>
>                 Key: TEZ-3834
>                 URL: https://issues.apache.org/jira/browse/TEZ-3834
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Jonathan Eagles
>            Assignee: Jonathan Eagles
>         Attachments: TEZ-3834.001.patch, TEZ-3834.002.patch, 
> TEZ-3834.003.patch
>
>
> {noformat:title=NPE 1}
> 2017-09-14 12:16:48,259 [ERROR] [main] |rm.TaskSchedulerManager|: Failed to 
> do a clean initiateStop for Scheduler: [0:TezYarn]
> java.lang.NullPointerException
>       at 
> org.apache.tez.dag.app.rm.TaskSchedulerManager.initiateStop(TaskSchedulerManager.java:696)
>       at 
> org.apache.tez.dag.app.DAGAppMaster.initiateStop(DAGAppMaster.java:2223)
>       at 
> org.apache.tez.dag.app.DAGAppMaster.serviceStop(DAGAppMaster.java:2239)
>       at 
> org.apache.hadoop.service.AbstractService.stop(AbstractService.java:221)
>       at 
> org.apache.hadoop.service.ServiceOperations.stop(ServiceOperations.java:52)
>       at 
> org.apache.hadoop.service.ServiceOperations.stopQuietly(ServiceOperations.java:80)
>       at 
> org.apache.hadoop.service.AbstractService.start(AbstractService.java:203)
>       at org.apache.tez.dag.app.DAGAppMaster$9.run(DAGAppMaster.java:2707)
>       at java.security.AccessController.doPrivileged(Native Method)
>       at javax.security.auth.Subject.doAs(Subject.java:422)
>       at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1936)
>       at 
> org.apache.tez.dag.app.DAGAppMaster.initAndStartAppMaster(DAGAppMaster.java:2703)
>       at org.apache.tez.dag.app.DAGAppMaster.main(DAGAppMaster.java:2508)
> {noformat}
> {noformat:title=NPE 2}
> 2017-09-14 12:16:48,610 [ERROR] [main] |rm.TaskSchedulerManager|: Error in 
> TaskScheduler when checking if a scheduler has unregistered, 
> scheduler=[0:TezYarn]
> java.lang.NullPointerException
>       at 
> org.apache.tez.dag.app.rm.TaskSchedulerManager.hasUnregistered(TaskSchedulerManager.java:998)
>       at 
> org.apache.tez.dag.app.DAGAppMaster.serviceStop(DAGAppMaster.java:2252)
>       at 
> org.apache.hadoop.service.AbstractService.stop(AbstractService.java:221)
>       at 
> org.apache.hadoop.service.ServiceOperations.stop(ServiceOperations.java:52)
>       at 
> org.apache.hadoop.service.ServiceOperations.stopQuietly(ServiceOperations.java:80)
>       at 
> org.apache.hadoop.service.AbstractService.start(AbstractService.java:203)
>       at org.apache.tez.dag.app.DAGAppMaster$9.run(DAGAppMaster.java:2707)
>       at java.security.AccessController.doPrivileged(Native Method)
>       at javax.security.auth.Subject.doAs(Subject.java:422)
>       at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1936)
>       at 
> org.apache.tez.dag.app.DAGAppMaster.initAndStartAppMaster(DAGAppMaster.java:2703)
>       at org.apache.tez.dag.app.DAGAppMaster.main(DAGAppMaster.java:2508)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to