[ 
https://issues.apache.org/jira/browse/HDDS-1370?focusedWorklogId=225399&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-225399
 ]

ASF GitHub Bot logged work on HDDS-1370:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Apr/19 04:22
            Start Date: 10/Apr/19 04:22
    Worklog Time Spent: 10m 
      Work Description: bharatviswa504 commented on pull request #715: 
HDDS-1370. Command Execution in Datanode fails becaue of NPE
URL: https://github.com/apache/hadoop/pull/715#discussion_r273784783
 
 

 ##########
 File path: 
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/common/states/datanode/RunningDatanodeState.java
 ##########
 @@ -86,7 +86,16 @@ public void execute(ExecutorService executor) {
     for (EndpointStateMachine endpoint : connectionManager.getValues()) {
       Callable<EndpointStateMachine.EndPointStates> endpointTask
           = getEndPointTask(endpoint);
-      ecs.submit(endpointTask);
+      if (endpointTask != null) {
+        ecs.submit(endpointTask);
+      } else {
+        // This can happen if a task is taking more time than the timeOut
+        // specified for the task in await, and when it is completed the task
+        // has set the state to Shutdown, we may see the state as shutdown
+        // here. So, we need to Shutdown DatanodeStateMachine.
+        LOG.error("State is Shutdown in RunningDatanodeState");
+        context.setState(DatanodeStateMachine.DatanodeStates.SHUTDOWN);
 
 Review comment:
   Added this because of the caller checks for datanode state. If endPoint 
state changed to shutdown, we need to stop this .
   
   Calling code:
    while (context.getState() != DatanodeStates.SHUTDOWN) {
         try {
           LOG.debug("Executing cycle Number : {}", 
context.getExecutionCount());
           long heartbeatFrequency = context.getHeartbeatFrequency();
           nextHB.set(Time.monotonicNow() + heartbeatFrequency);
           context.execute(executorService, heartbeatFrequency,
               TimeUnit.MILLISECONDS);
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 225399)
    Time Spent: 20m  (was: 10m)

> Command Execution in Datanode fails becaue of NPE
> -------------------------------------------------
>
>                 Key: HDDS-1370
>                 URL: https://issues.apache.org/jira/browse/HDDS-1370
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>          Components: Ozone Datanode
>    Affects Versions: 0.5.0
>            Reporter: Mukul Kumar Singh
>            Assignee: Bharat Viswanadham
>            Priority: Major
>              Labels: MiniOzoneChaosCluster, pull-request-available
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> The command execution on the datanode is failing with the following exception.
> {code}
> 2019-04-02 23:56:30,434 ERROR statemachine.DatanodeStateMachine 
> (DatanodeStateMachine.java:start(196)) - Unable to finish the execution.
> java.lang.NullPointerException
>         at 
> java.util.concurrent.ExecutorCompletionService.submit(ExecutorCompletionService.java:179)
>         at 
> org.apache.hadoop.ozone.container.common.states.datanode.RunningDatanodeState.execute(RunningDatanodeState.java:89)
>         at 
> org.apache.hadoop.ozone.container.common.statemachine.StateContext.execute(StateContext.java:354)
>         at 
> org.apache.hadoop.ozone.container.common.statemachine.DatanodeStateMachine.start(DatanodeStateMachine.java:183)
>         at 
> org.apache.hadoop.ozone.container.common.statemachine.DatanodeStateMachine.lambda$startDaemon$0(DatanodeStateMachine.java:338)
>         at java.lang.Thread.run(Thread.java:748)
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to