[ 
https://issues.apache.org/jira/browse/AMBARI-5589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13982962#comment-13982962
 ] 

Dmytro Shkvyra commented on AMBARI-5589:
----------------------------------------

STR:
1) Install 2 node cluster (minimum) with HDFS and YARN. Check http://<ambari 
hosname>:8080/api/v1/clusters/<cluster 
name>/services/YARN/components/NODEMANAGER. Request responce should have list 
NM host_components and status of NM should be STARTED 
2) In menu "Hosts">"All Hosts">"Node Manager" on Hosts perspective select 
Decommision menu Item
3) On host, which has ResourceManager check /etc/hadoop/yarn.exclude file. In 
this file should be present list of hosts where NodeManagers have been 
decommited.
4) Go to Host perspective and select in menu of decommited NM "Delete" menu 
item 
5) Check http://<ambari hosname>:8080/api/v1/clusters/<cluster 
name>/services/YARN/components/NODEMANAGER. There is host with deleted NM 
should be dissapeared. Also check  /etc/hadoop/yarn.exclude file. Hostname 
where was deleted NM host component should be dissapeared also. 
6) Repeat step 5 for all hosts where was installed NM hostcomponent
7) Check http://<ambari hosname>:8080/api/v1/clusters/<cluster 
name>/services/YARN/components/NODEMANAGER. There is not should be any hosts in 
 host_components list and status of NM service componet should be INIT. Check 
also  /etc/hadoop/yarn.exclude on host with ResouceManager. This file should be 
empty.
8) Add NM back on some hosts
9) Start  NM's and run smoke tests
*Desired result:*
NM's were started successfuly and smoke test passed. 

> Cannot start NodeManager after decommissioning, deleting, and then adding it 
> back
> ---------------------------------------------------------------------------------
>
>                 Key: AMBARI-5589
>                 URL: https://issues.apache.org/jira/browse/AMBARI-5589
>             Project: Ambari
>          Issue Type: Bug
>          Components: controller
>    Affects Versions: 1.6.0
>            Reporter: Dmytro Shkvyra
>             Fix For: 1.6.1
>
>
> I decommissioned a NodeManager on a host, and then deleted it.
> Then I added NodeManager back to the same host.
> NodeManager will no longer start on this host, because from the 
> ResourceManager's perspective, this host is still in the exclude list.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to