[ https://issues.apache.org/jira/browse/MAPREDUCE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102590#comment-13102590 ]
Vinod Kumar Vavilapalli commented on MAPREDUCE-2949: ---------------------------------------------------- Thanks for the patch Ravi! Couple of comments: bq. I think LocalizerTracker should be under the ResourceLocalizationService as its not generic enough, to be made a seperate service.I feel that moving the tasks to the service startup may be a good thing. What I meant was that LocalizerTracker can be under ResourceLocalizationService but still extends {{AbstractService}} and thus implement the life-cycle properly, that's been the convention and discipline we've been adhering to in YARN :) Granted the localizerTracker is going away because of NM's shutdown hook, one thing that is not clear is how the {{cacheCleanup}} executor-service is going away during JVM shutdown in your manual verification after the patch. May be it is the timing and NM will probably shut down cleanly if there are no active thread in the pool? I'd think that we should do an explicit shutdown on the executor-service. > NodeManager in a inconsistent state if a service startup fails. > --------------------------------------------------------------- > > Key: MAPREDUCE-2949 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2949 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: mrv2, nodemanager > Affects Versions: 0.24.0 > Reporter: Ravi Teja Ch N V > Assignee: Ravi Teja Ch N V > Attachments: MAPREDUCE-2949.patch, Threaddump.txt > > > When a service startup fails at the Nodemanager, the Nodemanager JVM doesnot > exit as the following threads are still running. > Daemon Thread [Timer for 'NodeManager' metrics system] (Running) > Thread [pool-1-thread-1] (Running) > Thread [Thread-11] (Running) > Thread [DestroyJavaVM] (Running). > As a result, the NodeManager keeps running even though no services are > started. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira