[ 
https://issues.apache.org/jira/browse/YARN-5620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun Suresh updated YARN-5620:
------------------------------
    Attachment: YARN-5620.009.patch

Updating patch.
* Addressing [~jianhe]'s latest comments
* some javadoc, checkstyle and javac fixes

bq. IIUC, in this case, the ContainerImpl will receive the KILL event first and 
move to the KILLING state, and the CONTAINER_KILLED_ON_REQUEST will be sent to 
the container at KILLING state..
It goes to KILLING stage only if the AM explicitly sends a kill signal or the 
RM asks NM to kill. It is also possible that the an admin logs into the NM and 
does a 'kill -9' which will also cause the ContainerLaunch to send 
CONTAINER_KILLED_ON_REQUEST but it wont be in KILLING state.. right ?

bq. ..In testContainerUpgradeSuccess, could you make newStartFile a new upgrade 
resource, and verify the output is written into it, this verifies the part 
about the localization part as well.
Actually if you look at the _prepareContainerUpgrade()_ function, we create a 
new script file *scriptFile_new* while passed into the 
_prepareContainerLaunchContext()_ function which associates the new file to a 
new *dest_file_new* location.. this should verify that the upgrade needed a new 
localized resource. The output of the script is also written to a new 
*start_file_n.txt* which we read and verify to check if the new process has 
actually started.

Also by the way:

bq. We can use the ResourceSet#getAllResourcesByVisibility method instead, and 
so the getLocalPendingRequests method and the new constructor in 
ContainerLocalizationRequestEvent is not needed
The problem with getAllResourcesByVisibility, is it gets all resources. I just 
need the pending resources... So if you are ok with it, Id like to keep it as 
is..



> Core changes in NodeManager to support for upgrade and rollback of Containers
> -----------------------------------------------------------------------------
>
>                 Key: YARN-5620
>                 URL: https://issues.apache.org/jira/browse/YARN-5620
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Arun Suresh
>            Assignee: Arun Suresh
>         Attachments: YARN-5620.001.patch, YARN-5620.002.patch, 
> YARN-5620.003.patch, YARN-5620.004.patch, YARN-5620.005.patch, 
> YARN-5620.006.patch, YARN-5620.007.patch, YARN-5620.008.patch, 
> YARN-5620.009.patch
>
>
> JIRA proposes to modify the ContainerManager (and other core classes) to 
> support upgrade of a running container with a new {{ContainerLaunchContext}} 
> as well as the ability to rollback the upgrade if the container is not able 
> to restart using the new launch Context. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to