tysonnorris commented on a change in pull request #4109: update mesos-actor; 
cleanup orphaned failed task launches
URL: 
https://github.com/apache/incubator-openwhisk/pull/4109#discussion_r233036814
 
 

 ##########
 File path: 
common/scala/src/main/scala/org/apache/openwhisk/core/mesos/MesosTask.scala
 ##########
 @@ -142,9 +140,14 @@ object MesosTask {
           transid.finished(this, start, s"launched task ${taskId} at 
${taskDetails.hostname}:${taskDetails
             .hostports(0)}", logLevel = InfoLevel)
         case Failure(ate: AskTimeoutException) =>
-          transid.failed(this, start, ate.getMessage, ErrorLevel)
+          transid.failed(this, start, s"task launch timed out 
${ate.getMessage}", ErrorLevel)
           
MetricEmitter.emitCounterMetric(LoggingMarkers.INVOKER_MESOS_CMD_TIMEOUT(LAUNCH_CMD))
-        case Failure(t) => transid.failed(this, start, t.getMessage, 
ErrorLevel)
+          //kill the task whose launch timed out
+          destroy(mesosClientActor, mesosConfig, taskId)
+        case Failure(t) =>
+          //kill the task whose launch timed out
+          destroy(mesosClientActor, mesosConfig, taskId)
 
 Review comment:
   In this case destroy is just cleanup that is mesos-specific, so it should 
happen independent of the create future; i.e. `create()` timeout should return 
immediately, but trigger the cleanup of `destroy()`

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to